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INTRODUCTION. 


It  is  the  purpose  of  this  thesis  to  study  one  of  the 
common  mathematical  problems  of  the  physicist,  astronomer, 
engineer  and  statistician,  namely  to  derive  the  equation  of  a 
curve  which  represents  graphically  certain  observations  or 
experimental  data.      The  point  of  view  differs  radically  from 
that  of  purely  theoretical  treatment  in  which  one  finds  that 
thru  any  n  points  arbitrarily  chosen  he  may  pass  a  curvo  of 
any  form  whatever  provided  its  equation  contains  n  constants. 
The  problem  here  is:-  Given  n  points,  to  derive  with  a  mini- 
mum of  labor  an  equation  of  the  simplest  form  possible  with 
as  few  constants  as  possible ,  whose  locus  shall  fit  these 
points  nearly  enough  for  the  purpose  for  which  it  is  to  be 
used. 

The  common  and  almost  universally  accepted  method  of 
dealing  with  this  problem  is  the  method  of  least  squares. 
This  method  has  the  advantage  that ,  given  a  set  of  data  and 
an  equation  containing  certain  undetermined  constants,  it  so 
determines  these  constants  as  to  make  the  sum  of  the  squares 
of  the  residuals  a  minimum.      It  has  the  disadvantage  of  being 
impracticable  in  many  cases,  and  even  in  those  cases  where  it 
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is  most  applicable  very  laborious,  and  hence  if  another  method 
can  be  found  which  will  give  results  as  accurate  as  one's  da- 
ta and  do  it  with  less  labor  it  is  for  that  set  of  data  a  bet- 
ter method.      Moreover  the  method  of  least  squares  becomes  in 
some  cases  where  the  undetermined  constants  are  not  linearly 
involved  hopelessly  cumbersome  by  leading  to  equations  which 
can  not  be  solved.      It  is  one  thing  to  say  that  n  equations 
can  be  solved  for  n  unknowns  and  another  and  quite  different 
thing  to  solve  them,  as  for  instance  in  many  exponential  and 
transcendental  equations.      Probably  after  the  common  parabo- 
lic form 

y  =  aQ  ■*■  a-j_x      ao*2  +  +  anxn 

there  are  few  forms  more  commonly  useful  than  the  sine  curve 
and  the  form  with  undetermined  exponents.       But  if  the  period 
of  a  sine  curve  or  the  unknown  exponent  is  one  of  two  or  more 
constants  to  be  determined  the  method  of  least  squares  fails. 
Hence  there  is  certainly  need  for  some  systematic  method  of 
fitting  curves  of  these  forms.      Moreover  the  relative  ease 
with  which  the  constants  in  the  common  parabolic  form  may  be 
determined  by  the  method  of  least  squares  has  led  to  its  in- 
discriminate use  in  cases  where  some  other  form  of  function 
might  he  more  advantageously  assumed.      Certain  cases  where 
other  curves  are  useful  and  methods  of  fitting  these  to  data 
will  be  presented.      While  there  is  no  pretension  to  ex- 
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haustive  treatment,  it  is  hoped  that  some  of  these  problems 
and  principles  may  be  suggestive  and  useful. 


THE  METHOD  OF  MOMENTS. 


The  method  of  moments  as  a  method  of  curve  fitting 
is  due  to  Prof.  Karl  Pearson  and  has  been  published  by  him  in 
the  Philosophical  Proceedings  of  the  Royal  Society,  in  Bio- 
metrica  and  elsewhere. 

lit 

Considered  a  priori,  in  an  intuitional  rather  than„a 
mathematical  way,  there  would  seem  to  be  some  reasonableness 
to  the  method.      There  are  given  n  points.      If  they  conform 
to  some  law  capable  of  expression  in  an  equation,  the  problem 
is  to  find  that  equation.      If  the  equation  is  to  give  a  per- 

-fke 

feet  fit,  thenA theoretical  curve  and  the  experimental  curve 
together  with  certain  fixed  ordinate s  at  x  at  a  and  x  =  b  and 
with,  the  x-axis  bound  the  same  area. 

But  an  indefinitely  large  num- 
ber of  curves  can  be  drawn  which  will 
give  the  same  /.  area  under  the  curve 
between  the  same  limits  as  in  the 
accompanying  figure.       If  the  first 
moments  of  the  theoretical  and  obser- 
cfx  *    vational  curves  are  equal,  then  the 

sums  of  the  products  obtained  by 


\ 
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multiplying  each  element  of  area  by  its  distance  from  an  arbi- 
trarily chosen  axis,  that  is , Jxy  dx    in  the  two  curves  are 
equal.      This  will  give  a  condition  as  to  the  distribution  of 
the  area  along  the  x-axis.      If  second  moments,  that  is, 
Jx^y  dx    are  equal,  there  is  a  second  condition  on  this  distri- 
bution of  area  along  the  x-axis.      If  in  this  way  as  many  con- 
ditions be  put  upon  the  theoretical  curve  as  it  contains  un- 
determined constants  it  might  seem  that  a  method  of  curve- 
fitting  were  here  set  up.       It  remains,  however,  to  prove  ma- 
thematically that  this  method  of  procedure  actually  will  fit 
a  curve  to  observations,  and  to  show  what  degree  of  accuracy 
may  be  expected  under  different  conditions  as  to  data,  and 
form  of  function  assumed. 

Given  a  series  of  observations  of  a  variable  y, 
corresponding  to  a  series  of  values  of  a  second  variable  x, 
to  show  that  the  method  of  moments  furnishes  a  good  method  of 
fitting  to  this  data  a  theoretical  curve 

y  =  f(x,olfc2,  cn) 

where  the  c's  are  arbitrary  constants.      Let  it  be  assumed 
that  the  function  can  be  expanded  by  Taylor's  theorem  and  con- 
verges fairly  rapidly.      Let  the  expansion  be 

y  =  f(0)  *  f(0)x  t  1     (°)X-4  ^      -  -  - 

31 


aox2  a*x3 
=  an  f  o,x  t      —  ■*■  ~ — 

1         2 !  31 
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Each  a  will  be  a  function  of  the  c's  of  the  original  equation. 


is 


Kence  when  each  of  the  first  n  a' s^ expressed  in  terms  of  the 
n  c's,  there  arise  n  equations  which  may  be  solved  for  the  c's 
in  terras  of  the  first  n  a's.      Then  the  succeeding  a's,  i.e., 
those  after  an_-j.  may  be  expressed  first  in  terms  of  the  c's, 
then  by  substitution  in  terms  of  the  first  n  a's. 
Then 


y  =  aQ  +  a-j_x  +  aPx" 


+  _  — 


an-lx 


n-1 


21  (n-1)! 
■p(nh  «    o  ,       wn  .  x.(n*l)/_  0  un*l 

(n*l)«. 


nl 


Let  y'  be  the  observed,  y  the  theoretical  value.      Then  y  -  y' 
is  the  difference  between  the  observed  and  theoretical  curves 
at  a  point  corresponding  to  a  given  x.       If  /(y  -  y' )2  dx  be 
made  a  minimum,  y  -  y'     ill  be  made  smaUT,      It  would  not  be  of 
much  service  to  make/if  -  y')  dx  a  minimum  for  even  if 
f\ If  "  y1 )  dx  =  0,  it  only  means  that  the  areas  under  the  curves 

are  equal ,  and  there  would  be  nothing 
to  exclude  curves  like  those  in  figure, 
y  -  y'  is  a  part  of  the  time  positive , 
the  other  part  negative  and  the  two 
quantities  exactly  cancel  each  other, 
although  the  curves  are  entirely  differ- 
ent.      But  (y  -  y' )     is  necessarily 


positive  and  since        y  -  y' )^  dx  involves  the  summing  of  all 
the  necessarily  positive  products  of  (y  -  y')^    and  dx ,  making 
jiy  -  y' )2  dx    a  minimum  is  bound  to  bring  the  theoretical  and 
experimental  curves  as  near  together  as  the  given  data  and  the 
assumed  form  of  equation  will  permit. 

To  mal\e  J*{  y  -  yf  )^  dx    a  minimum,  its  first  derivative 
must  be  zero.      This  makes 

/(y  -  r  )  eftr  dx  =  o 
for  y  is  the  variable  and  differentiation  under  the  integral 
sign  can  be  performed  by  ordinary  method  of  differentiation  if 
the  limits  of  integration  are  independent  of  the  quantity  with 
respect  to  which  the  differentiation  is  performed.     /y  is  the 
variation  in  y  due  to  the  variations  in  the  coefficients  b,q  ,  a^  , 

an-l* 

fnj  -  foJ  I  /7 

+/&*..g»M  -ft  +SL 

A,XW  ^f^+'lr  =>-fC"+yJ  <zfr"+'Js  ) 


+  


=      (,+         *  s£t?'!z?+  -  -  -J 
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+  at ^"V^  _  -  -J 

+  

Let    Pi  =  fn(Qx)x^    bo  the  remainder  after  n  terms.  Then 

nl 

0<©<1.      Also  the  coefficient  of  any  one  of  the    a's  after 
the  first  term  is  the  partial  derivative  of  R  with  respect  to 
the  a  to  which  that  particular    a  belongs. 

Substituting  the  value  of    <cfy  in 
and  collecting  coefficients 


o 


Since  &Qf  _  _  _  ^2  are  independent  of  each  other,  i.e., 

since  we  may  vary  the  coefficients  independently  of  each  ether, 
this  equation  must  be  an  identity  and  each  coefficient  is 
equal  to  zero. 


9 


ffy-yJfg+W*-0 

Let  A  be  the  area .  AM^ ,  AMg ,  -  -  -  AM^j  the  first  (n-1)  mo- 
ments of  the  theoretical  curve,  A,  the  area  and  A*M£,  A'Mf2, 

-  -  -  A'M1  ,     like  moments  of  the  observation  curve  about  the 
n-1 

y-axis  which  of  course  may  be  placed  at  any  point  where  it  may 
bo  found  convenient. 

Performing  the  indicated  integrations  so  far  as  possi- 
ble ,  and  clearing  of  fractions,  gives 


II 


10. 


The  term  involving  the  integral  will  be  small  for:- 

1.  It  involves  the  y-y?  which  is  snail  because 
/(y-y'.^dx  has  been  made  a  minimum. 

2.  R  a  3Lm  fn(ex)  is  by  hypothesis  small  if  n  be  not  very 

n; 

small,  for  the  series  was  rapidly  convergent. 

3.  If  the  first  n-1  moments  of  a  curve  are  equal  succeed- 
ing moments  become  more  and  more  nearly  equal  the  larger  n 
becomes,  for  we  put  more  and  more  conditions  on  the  distributiai 
of  the  equal  total  areas  along  the  x-axis.  But 


vanishes  if  the  higher  moments  are  equal. 


+  

But  A  =  A1     and  the  factors    IvL,-M '  .  M    --Ml  ,,     are  small  be- 

n    n'    n+1  n+1 

.cause  the  higher  moments  of  two  curves  having  their  first  n-1 
moments  equal  are  nearly  equal. 

Neglecting  in  II  the  term  which  contains  these  three 
small  factors  gives 


11 


A    =  A 


AM. 


A'M-j 


III 


AI', 


2  =  A,;C 


AM. 


if  theexpansion  by  Taylor's  Theorem  has  a  finite  number  of 
terms,  n,  as  in  common  parabolic  curve    R  =  0,  and  this  is  not 
an  approximation  but  is  perfectly  accurate.      That  is,  the  n 
constants  of  the  theoretical  curve  are  determined  by  equating 
its  area  and  first  n-1  moments  to  the  area  and  first  n-1  mo- 
ments of  the  observed  curvo .      This  gives  n  equations  to  deter- 
mine the  n  constants. 


that  the  fundamental  difference  lies  in  the  choice  of  the  quan- 
tity minimized.  In  least  squares  it  is  the  sum  of  the  squares 
of  the  distances  between  the  theoretical  and  observed  curves 
measured  along  the  ordinates  of  the  points  that  represent  the 
plotted  data.  By  the  method  of  moments  the  quant Ity  minimized 
is  J(y-y)^  dx ,  that  is,  the  sum  of  the  squared  of  all  the  dis- 
tances between  the  curve  measured  on  every  ordinate.  For  this 
reason  the  method  of  least  squares  aims  at  getting  a  curve  that 


Comparing  the  method  with  least  squares  method  shows 
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will  strike  all  the  observed  points  as  well  as  possible,  while 
the  method  of  moments  aims  at  fitting  the  curve  to  a  hypothe- 
tically  existing  curve  which  is  the  expression  of  the  law 
sought  and  whose  area  and  moments  are  the  same  as  those  of  the 
observed  curve.      It  will  thus  tend  to  smooth  out  any  irregu- 
larities in  the  data  and  while  it  may  not  come  so  close  to 
some  irregularly  placed  point, may  by  that  very  fact  better  ex- 
press the  law  involved  in  the  data,  just  as  it  is  universally 
recognized  that  a  smooth  curve  drawn  through  a  number  of  point 
representing  experimental  data  is  a  better  expression  of  the 
law  than  the  polygon  made  by  connecting  up  the  points.  The- 
oretically there  is  no  limit  to  the  degree  of  exactness  of  fit 
ting  any  set  of  points.      The  curve  may  be  chosen  with  more 
and  more  constants  and  by  equating  areas  and  moments  as  many 
equations  are  formed  as  there  are  constants  to  be  determined. 
Practically  the  limit  is  found  in  the  rapid  increase  of  proba- 
ble error  in  the  higher  moments,  as  the  number  of  dimensions 
increases. 

The  three  essentials  for  the  use  of  this  method  are 

then:- 

a.  We  must  be  able  to  ascertain  the  moments  of  the  theo- 
retical curve  in  terms  of  its  constants. 

b.  We  must  know  hdw  to  find  area  and  moments  of  any  sys- 
tem of  observations,  i.e.,  quadrature  formulas. 
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c.  The  expressions  for  moments  of  the  theoretical  curve 
in  terms  of  its  constants  must  give  equations  capable  of  numeri- 
cal solution. 

As  to  the  form  of  function  to  "be  assumed  in  any  case 
it  may  he  said  that  for  frequency  curves  Pearson  has  worked 
out  a  set  of  five  types  of  curves  derived  from  the  point  bino- 
mial and  the  theory  of  probability. 

(Philosophical  Transactions  of  the  Royal  Society,  Volume  186A 
pp.  344-414.) 

Merely  as  illustrating  and  applying  one  of  the  sim- 
pler forms  which,  Prof.  Pearson  has  used  in  his  unified  and 
systematic  treatment  of  frequency  curves  in  general  a  problem 
in  school  attendance  will  be  fitted  with  a  point  binomial. 
But  the  paper  is  in  the  main  concerned  with  the  other  class  of 
problems,  those  belonging  to  the  physicist  and  engineer  rather 
than  the  statistician.      Here  obviously  the  forms  of  curve 
will  be  very  different  from  those  occurring  in  statistical 
work.      In  some  cases  the  form  of  the  curve  is  dependent  upon 
some  known  physical  law  which  leaves  only  certain  constants  to 
be  determined,  as  the  coefficient  of  discharge  or  coefficient 
of  velocity  in  a  nozzle  of  a  fire-hose.      Sometimes  a  mere  in- 
spection of  the  plotted  points  will  suggest  a  type  of  function, 
as  for  instance  a  manifest  periodicity  may  give  the  general  ef- 
fect of  a  sine  curve.      But  in  many  cases  the  assumption  of  the 
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form  of  the  function  will  be  more  or  less  arbitrary,  and  some- 
times two  quite  different  forms  of  function  appear  to  give 
equally  good  or  equally  bad  fits  over  a  certain  rather  limited 
range  of  observations.      A  discussion  of  forms  of  functions 
suited  to  certain  data,  based  on  the  behavior  of  successive  or- 
ders of  differences  will  be  given  in  section  at  the  ond  of  this 
thesis . 
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ON  FREQUENCY  CURVES. 


/  It  is  a  familiar  fact  in  the  theory  of  probability 

that  the  kth  term  of    (1/2+ 1/2 )n    =  (l/2)n4-  n(l/2)nf- 

n(n-l )(l/2)n  f    j-(i/2)n      represents  the  probability  of 

21 

throwing  k-1  heads  and  (n-k-M)  tails  in  n  throws  of  a  coin. 
Also  that  for  fairly  large  values  of  n  as  from  10  to  20,  if 
these  numbers  be  plotted  as  ordinate s  at  equal  intervals  along 
the  axis  of  abscissae  the  points  lie  very  close  to  the  normal 
frequency  curve    y  =  l&F*  where  <?  and  k  are  constants 

dependent  on  n.      The  sum  of  all  the  terms  is  obviously  unity, 
corresponding  to  the  certainty  that  some  one  of  these  events 
must  happen.      Hence  if  rectangles  be  constructed  of  unit  base, 
and  altitudes  equal  to  the  successive  terms  of  the  expansion, 
the  area  of  the  kth  rectangle  represents  the  probability  of  the 
occurrence  of  k-1  heads  and  (n-k+1)  tails  in  n  throws.  Thus 
if    n  =  10  the  successive  rectangular  areas  from  left  to  right 
represent  the  probability  of  throwing  0.  1,  2,  3,  -  -  -,  10 
heads  in  10  throws.      The  curve  in  dotted  lines  represents  ap- 
proximately this  sane  area  under  a  continuous  curve  very  like 

y  =  Jce-*2/2^. 


This  represents  a  case  where  the  probabilities  of  the 
two  elementary  events  namely  that  any  coin  fall  as  a  head  or  as 
a  tail  are  equal,  hence  the  probabilities  of  a  heads  and  b 
tails  is  just  equal  to  the  probability  of  b  heads  and  a  ta.ils 
and  the  curve  is  symmetrical. 

If,  however,  we  consider  the  probability  of  throwing 
an  ace  with  a  single  die  it  is  l/6.      By  the  same  theory  as  the 
preceding  the  kth  term  of    (5/6-*-l/6)n    represents  the  probabi- 
lity of  throwing  k-1  aces.      This  manifestly  will  give  an  asym- 
metrical curve  if  plotted  in  the  same  manner  as  before.  And 

the  type  of  curve  formed  by  plotting  as  ordinates  Ax  successive 

c 

terms  of    (p-^q)n    where    p  +  q  =  1    and  A  is  the  area  under  the 
curve,  at  intervals  c  along  the  x-axis  is  used  by  Prof.  Pearson 
to  fit  data  of  the    nature  of  frequency  curves  which  have  lim- 
ited range  in  both  directions  with  skewness.      A,  the  area,  re- 
presents the  total  number  of  observations. 

Problem-  To  fit  the  best  point  binomial  curve  to  this 
set  of  data.      Number  of  children  in  Boston  school  by  ages. 

Age.        Number.  Age.        Number.  Age.  Number. 

3-  4  205  9-10  7908  15-16  3800 

4-  5  1765  10-11  7805  16-17  2301 

5-  6  5101  11-12  7526  17-18  1414 

6-  7  7513  12-15  7198  18-19  747 

7-  8  777  9  13-14  6862  19-  415 

8-  9  793  9  14-15  5465 
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Plotting  the  data  of  the  school-attendance  problem  shows -asym- 
metry or  skewness  hence  the  applicability  of  the  point  binomial 
curve  with  p  /  q, 

Let  the  figure  be  considered  as  a  series  of  rectangles 
n+1  in  number  of  base  c  ,  and  the  altitudes  a/c  times  successive 
terms  of    (pf-q)n,  that  is,  the  rth  altitude 


=  A  n(n-l)  ( n-r+2 )  T.n-r-Mnr-l 

c  (r-l)i  p  q 


The  area  of  any  one  of  the  elementary  rectangles  = 


cv    =  A  n(n-l)  (n-rf2)  ^n-rf-lqr-1 

cyr      A  (r-l)l  P  q 


Y 


<-£->  < — £ — >  <— 2 — »  < — c — »  < — c — >  <— £- 


X 


Draw  a  y-axis  1/2  c  to  the  left  of  the  initial  point  of  the 
first  rectangle.      If  then  instead  of  multiplying  each  element 
of  area  by  the  nth  power  of  its  x  coordinate  and  adding  the  pro- 
ducts by  A. xny  dx    to  get  the  nth  moment  we  multiply  the  nth 
power  of  the  average  value  of  x  by  the  whole  area  we  get  a  fair 
approximation  for  the  value  of  the  nth  moment. 


IS 


Using    AMq,  AM^  ,  ALIg ,  -  -  -  AM      to  represent  the  a- 
rea  and  first  n  moments,  we  get,  since  the  distance  is  in  the  rth 
rectangle  is  rc, 

y-  y.  J 

for  A  in  the  value  of  cyr  and  cs  in  (rc)s  is  a  constant  factor 

n 

common  to  every  term  and  p    the  remaining  factor  of  the  first 
area  is  multiplied  by  the  sth  power  of  its  value  of  r^ which  is 
unity,    npn~1q    by  sth  power  of  its  value  of  r,  and  so  on 
throughout  the  expansion. 

Substituting  value  of    p  =  1-q 


-h  {*-')  s  "J 

Collecting  like  powers  of  q  and  afterward  simplifying  the  coef- 
ficients 

1  £3  J 


7         &  &  ^  - 

&  /  J 


19. 

4 

J 

Using  this  formula  to  derive  the  first  four  moments  necessary 
to  determine  the  four  quantities  p,  q,  n  and  c  of  the  point 
binomial , 

-  Ac(l +  nq) 
AM    =  Ac2£i  +•  3nq  +  n(  n-1 )  q^J 

AfiL  =  Ac5  £l  t  7nq  i-  6n(  n-1 )  q2  -h  n(  n-1 )  ( n-2  )  q3J  IV 
AM4  =  Ae4[l+15nq+ 25n(n-l)q2^10n(n-l)(n-2)q3 
f  n(n-l)(n-2)(n-3)q4J 
for  the    ~&+l)st  and  all  succeeding  polynomials  vanish  identical- 
ly. 

But  the  moments  are  taken  about  an  axis  distant  1/2  c 
from  initial  point  of  the  first  rectangle  and  c  is  an  unknown. 
In  order  there  that  we  may  take  moments  of  the  observed  curve 
and  of  the  theoretical  curve  about  same  axis  some  new  axis  com- 
mon to  both  must  be  determined.      Since  obviously  the  centroids 
of  the  areas  must  coincide  a  centroidal  axis  of  moments  is  con- 
venient.     Since  this  change  of  axis  is  often  necessary,  formu- 
las are  here  derived  for  transfer  of  moments  from  any  axis  to 
any  other  and  then  a  specia  1  and  somewhat  simplified  form,  for 
the  case  when  the  new  axis  passes  through  the  centroid. 


20. 


X 


Problem.-  To  find  nth  moment  of  area  abed  about  yf -ax- 
is in  terms  of  its  moments  about  the  y-axis. 
Let    Al,In    be  nth  moment  about  y-axis. 

AM1     be  nth  morr.ent  about  y'-axis. 
n 


be  nth  distance  between  these  axes. 


Then 


/7 


*7 


Dividing    out  A 

since  Mq  is  always  unity  and  is  introduced  merely  for  symmetry 
of  formulas. 

If  now  the  y'-axis  be  centroidal ,  then  by  definition 
the  first  moment  vanishes  and 


But 


K  = 0 

0  =  U±  -  d 
d    =  M, 


which  is  ordinary  method  of  getting  x. 


I 


21. 

Ill  ~  Mo  -  SdM1  *h  d2 

V 

=  M2  -  d2 

which  is  ordinary  formula  used  in  mechanics  for  transferring 
moments  of  inertia  to  centroidal  axis. 

In  general,  this  equality  of    d  =  M-,     will  shorten 
the  formulas  by  combining  the  last  two  terms,  and 


M'  =  M,  -  3dM«  +■  2d° 

L1J  =  1vIa  ~  4aM*;      6d2Mp  -  3d4 
Applying  these  formulas  to  equations  IV 
d  =  Mn   =  c  ( 1  +  nq ) 
M*  =  0 

M'  =  ML  -  d2 

2  2 

=  c2[l-f-3nq-f-n(n-l)q2J  -  c2  jl  +  2nq  +  n^q2] 
■      jjiq-nq2/  =  c2nq(l-q)  =  c2npq 
M'  =  c3J[l  +  7nq+-  6n(n-l)q2-f-  n(  n-1 )  ( n-2 )  q2] 

-  5c5£l  +■  ono  +  n(n-l  )q^J[l  -f  nqj 
+  2c3[l"f-nqj  " 

=  c3  [l  +  7nq  -h  (  6n2-  6n )  q2  +  ( n3-3 n2  +  2n )  q2] 

-  c3  [o  -J-  12nq  -t-  ( 12n2-3n) q2  +  ( 3n3-3n2 )  q3  J 
+  c3  [2  +  6nq  -/-6n2q2  4-2n3q3J 

=  c3  £nq-3nq2  +•  2nq^J 

-  c3nq  [l-3q  +  2q2J   =  c3nq(  1-q )  ( l-2q) 
=  c3npq(p-q) 


VI 


22. 


M'  =  c4/~l-*-15nq-*-25n(n-l)q2+-10n(n-l)(n-2)q3-f-  n( n-1 )  ( n-2 )  ( n-3  )q4 
4  u 

-4c4[l  +  7nq  +  6n( n-1 ) q2+  n( n-1 ) ( n-2 ) q5][l  +  nql 
+6c4/l  +  3nq+  n(n-l)q2][l  +  2nq  +  n2q2J 
-5c4[l+  4nq-f- 6n2q2+ 4n3q3-f-  n4q4J 
=  c4 [l  +  15nq  -f  ( 25n2-25n) q2  +  ( 1 0n3-3 On2-*-  20n) q~ 
+  ( n4-  6n3+  lln2-  6n )  q4J 
-c4  j4  +  32nq+  ( 52n2-24n)q2  +  (28n3-3  6n2  +  8n)q 

■M4n4-12n3+8n2)q4] 
4-c4  [6  +  3 Onq  +  ( 48n2-  6n )  q2  +  ( 3 0n3-12n2 )  q3  -h  (  6n4- 6n3  )  q4 J 
-c4[3  +  12nq  +  18n2q2-h  12n3q3  -f-  3n4q4j 
=  c4[nq-*-(3n2-7n)q2-f-  (-Gn2-f-12n)q3  V- (3n2-6n)q4J 
=  c4nq[l  +  (3n-7)q  -H(-6n  -H2)q2-*-  (3n-6)q5J 
=  c4nq(l-q)[l + (3n-6)q-:3n-6)q2j 
=  c4npqjl  +■  3(n-2)q(l-qjJ 
=  c4npq£l  +  3  ( n-2 )  pq  J 

There  are  now  four  equations  to  determine  four  unknown  elements 
of  the  point  binomial. 


p+q 


p 

M'  =  c~npq 

VII 

I.J!.  =  c°npq(p-q) 


M»  = 
4 


c4npq  Jl  +  3  ( n-2 )  pqj 


23. 


Eliminating  c , 


(Ma)  _  nfog 


Eliminating  n, 


Substituting  in  (1) 

(fee/  .  fai)* 


M*  "  "WWW* 


Dividing  out  (M* )      and  eliminating 

where  obviously  the  sum  of  the  roots  of  the  quadratic  =  1. 
Hence  one  root  may  he  taken  for  q  and  the  other  for  p.  Hence 


pq  =  k,  constant  term  of  the  equation  and    p-q  =  5  from 

a 

theory  of  quadratic  equations    ,  

From  VII 


«      ML  =r  &{MzJ3 


The  area,  and  moments  of  the  observation  curve  are  computed  on 
the  same  assumption  as  are  those  of  the  theoretical  curve. 
The  data  is  represented  by  the  figure-  of  plate  I.      Each  space 
^.along  the  x-axis  represents  a  year  of  age  and  erecting  on  each 
a  rectangle  whose  ordinate  is  the  number  of  children  of  that  a^e 
in  the  Boston  schools,  the  area  =  xy  =  y  =  number  pupils  of  that 
age  in  the  schools.      Then  the  sum  of  these  y's  gives  the  total 
area  under  the  curve.      If  the  area  be  considered  concentrated 
along  the  mid-vertical  and  origin  taken  at  the  age    2  1/2 
years,  then  the  sum  of  the  products  xy  gives  first  moment,  the 
sum  of  the  products  x^y  ,  the  second,  and  so  on  up  to  the  num- 
ber needed  for  the  problem,  in  this  case  the  first  four. 


Atte ndanc  e  p.y  Qqe& 

in 

Boston  Public  Schools, 

-rii  Olpservafios. 
-©-Point  Binomial. 

Generalized  Proi?aJ?/J/ty  Curve. 

y=  iocf'(-n9.?il)(^3(o.?i*Lr'3^e2  3IX 


n     j?  20 


PI  ate  I. 


4 


25 


X 

y 

xy 

x2y 

x3y 

x4y 

205 

1 

205 

205 

205 

205 

205 

2 

1765 

3530 

7060 

14120 

28240 

3 

5101 

15303 

45  909 

137727 

413181 

4 

7513 

30052 

120208 

480832 

1923328 

5 

7779 

38895 

1 94475 

972375 

4861875 

6 

7939 

47634 

285804 

1714824 

10288944 

7 

7908 

55356 

387492 

2712444 

18987108 

8 

7805 

62440 

499520 

3096160 

31969280 

9 

7526 

67734 

609606 

5486454 

49378086 

10 

7198 

71960 

719o00 

7196000 

7] 080000 

11 

68C2 

75482 

830302 

9153322 

100466542 

12 

5465 

65580 

766960 

9443520 

113  322240 

13 

3800 

49400 

642200 

8348600 

108531800 

14 

23  01 

32214 

450996 

6313  944 

88395216 

15 

1414 

21210 

518150 

4772250" 

71583750 

-1-  u 

747 

11  952 

1 91232 

3  05  9712 

48955^92 

17 

415 

7055 

119935 

2058895 

34661215 

81743 

656022 

6209854 

65823384 

755746402 

Mi 

;0  =  A  = 

8174o 

AA 

k  = 

65  6022 

n 

•2  = 

6209854 

Al. 

-3  = 

65823384 

AI. 

l4  = 

755746402 

The  M*s  are  found  by  division  and  by  definition  of  centroid 
MU  =  distance  of  the  centroid  from  thea.ssumed  axis.  Trans 
ferring  moments  to  a  centroidal  axis  by  the  formulas  VI, 
Mz  =  Mz  -at*  -  tt.  ^6/0 


< 


Substituting  in  equation    (VIII)  ,  the  constant  term  =  pq  = 
.239766  whence 

p  =  .601 

q  =  .399 

Substituting  for  other  values  in  formulas  IX  and  X, 

c  =  4.287 
n  =  2.632 

All  the  elements  of  the  point  binomial  are  now  known  and  the 
terms  are , 

The  next  tern  would  contain  the  negative  factor  n-3  and  hence 
can  not  be  used.      Calculating  these  terms  by  logarithms  gives 
them  in  order    4993,  8723,  4726,  661.      To  determine  the  po- 
sition of  these  terms  to  the  right  or  left  of  the  centroidal 
axis,  it  is  remembered  that  the  first  assumed  axis  of  the 
theoretical  curve  was    1/2  c    to  the  left  of  the  mid  vertical 
of  the  first  rectangle.      But  the  axis  was  moved    c(l  +  nq)  to 
the  right.      Therefore  the  n id-vertical  of  the  first  rectangle 
is    c(l+nq)  -  c  =  cnq  =  4.502    to  the  left  of  the  centroidal 
axis.      But  this  centroidal  axis  lies  at  the  age    2 .5  4-8.025  = 
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10.626.      Hence  the  first  term  is  to  be  plotted  at  6.023. 
The  others  are  at  equal  intervals    c  =  4.287    and  lie  at 
10.310,  14.597,  18,884.      The  curve  does  not  in  the  figure  ap- 
pear to  give  a  very  good  fit  at  the  top  but  the  area  in  the 
four  rectangles  of  the  theoretical  curve  =  81895. 

observed  curve  =  81745 
difference  =  152 

which  is  less  than  1/5  %.      It  is  not  possible  here  to  compare 
individual  ordinate s  for  the  curve  is  not  continuous  as  a  curve 
of  the  form    y  =  f(x).      A  smooth  curve  can  of  course  be 
sketched  in  connecting  the  computed  points  as  in  the  figure. 

It  is  desired  now  to  pass  from  this  point  binomial 
curve  to  an  ordinary  analytical  geometry  curve     y  =  f(x). 

As  an  illustration  of  the  method  the  normal  probability  curve 
2  /  ° 

y  =  ce~x  /2c*      will  be  derived  from  the  symmetrical  point  bi- 
nomial, to  which  its  resemblance  is  well  known,  and  then  an 
analogous  method  will  be  taken  to  derive  the  generalized  pro- 
bability curve  from  the  asymmetrical  point  binomial.      An  ex- 
pression for  the  ratio  3^°.Pe   in  terms  of  x  is  found, 

mean  ordinate 

and  the  resulting  differential  equation  solve  for  the  relation 
of  x  and  y. 

The  rth  term  of  tho  symmetrical  point  binomial  is 
y  =  K  tUhzli  fa-r+ej  fJL J  " 

The  next  term  is 
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If  these  ordinates  "be  plotted  at  equal  intervals  c  along  the 
x-axis,  the  slope  of  the  line  joining  the  tops  of  the  ordinates 

is  Jkislr. 

The  mean  ordinate  is   rrr'    < r- 

siope   ~    y^*/  ~ y'r 


Lean  Ordinate  fyrf  +)rJ  C 

But  cr  =  x      and    c(r-f-l)  =  xpf ^  •      And  in  a  continuous  curve 

the  slope  is    dy_    and  the  mean  ordinate  is  y. 
dx 

If  x  represent  the  mean  abscissa  as  y  the  mean  ordinate 
Making  the  linear  substitution 

which  merely  moves  the  y-axis    c(nf  2)     to  the  right  and  so, 

p 

of  course,  does  net  affect  the  shape  of  the  curve,  and  writing 

y  Jx  a* 

Integrating 

/oa.  y  =s  -  2^1  +  /< 

The  generalized  probability  curve  is  derived  from 
the  asymmetrical  point  binomial  in  a  strictly  analogous  man- 
ner.     The  rth  terra  is 


29. 

./  -     K  fffa-fJ  fa-r+z) 

y     ^      1  \   £  /■  /  

Vr+i     n  — — L —  r  " 

slcr-  -  yr+ry? 

c 

Mean  ordinate  =     -g-  ( y  ^     y^  / 

Ratio  -    J2±'    'r          .        y  "p 

Let            =  V 

Mean  ordinate  =  y 

Slone  =  &y 

y  dx    ^    &  fa-r+/Jv+r 

(a) 

Let  mean  absicssa  =    X  -  +Xf+/J 

ZX  =    gey  +c 

K   -     X  / 

Substituting  this  value  of  r  in  (a) 

Replacing  v  by    q/p    and  remembering    p-Z-q  -  1 

30. 


By  moving  y-axis  by  means  of  the  linear  substitution  x' 
x-c(l/2-/-q£ri -f-lj)     and  dropping  primes 


Let  K 


and  L 


Inte grating 

-K*  +  *  L  /o9e  (x-t-L) -h/ope  y 


B 

■       v.  <s 

(x+i-y 


y  -    y  e 


But  for  the  problem  solved    p  =     .601,  q  =  .399,  n  =  2.632, 
c  =  4.2S7.       Substituting  these  values  in  the  expressions  for 
K  and  L  given  above  , 

K  =  2.310  L  =  36.966  KL  =  85.391 

Hence  everything  is  determined  but  yQ  which  being  a  constant  of 
integration  must  be  determined  by  substituting  simultaneous 
values  of  x  and  y.       Such  simultaneous  values  referred  to  the 
original  axis  (  1/2  c    to  the  left  of  the  initial  point  of  the 
first  rectangle  of  base  'c  )  and  found  on  pp.  2.4?  and  Zf* 
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J- 

=  c 

=  4.2S7 

yl 

=  4993 

X2 

=  2c 

=  8.574 

y2 

=  8723 

x3 

=  3c 

=  12.851 

y3 

-  4726 

x4 

=  4c 

=  17.148 

y4 

DOl 

But  by  change  of  axis  ,  page  , 

x'  =  x  -  c  [l/2  +  1)] 
Substituting  the  values  of  c,  q,  and  n  shov/s  the  axis  shifted 
to  the  right    6.3561    andhence  falling  at  the  ago  10.0925. 
The  values  of  x  then  become 

xx  ~  -4.069 

x9  =  .218 
=  4.505 

o 

x4  =  8.792 

As  a  check  against  errors,  log.  yQ  was  computed  three  times 
by  substituting  the  first  three  pairs  of  simultaneous  values 
of  x  and  y  in    log  y 

l0se  y0=  l0ee  y  t  kx  -  kl  ioge  (xfL) 

Whence 

lo610  y0  =  los10  y  +  —   "  ^  loSlO  ^x  **~  L^ 

2.3026 

=  log10  y  +  1.0032  x    -    KL  log10  (x  +-  L) 
The  three  values  of  log10  y      so  found  are    -129.936,  -129.940, 
and  -129.946.      Using  the  average  of  these  three  gives 
y  =  log-l(-129.941)(x-f-  36.966)85-391e-2-31  x 
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Computing  simultaneous  values  of  x  and  y  by  this  equation  gives 


x 


•  6 
■5 
•4 

■3 
•2 

•  1 

0 
1 

2 
3 
4 

5 
6 
7 
8 
9 


Total 
Observed 

Difference 


Computed 

y 

2431 
3  647 
5C03 
6403 
7570 
8337 
8395 
8556 
7489 
6598 
5410 
4207 
5121 
8209 
1524 
970 


83865 
81743 


Average 
observed 

y 

985 
3433 
6307 
7646 
7859 
7  924 
7857 
7666 
73  62 
703  0 
6164 
4633 
5050 
1858 
1080 


Difference 


1446 
214 

1504 
1245 
289 
415 
558 
690 
127 
452 
754 
426 
71 
351 
444 
389 


Average  = 


error 


146.8 
6.2 
20.7 
16.3 
3  .7 
5  .2 
6.8 
9.0 
1.7 
6.1 
12.2 
9.2 
2.3 
18.9 
40.8 
66.9 


tr><7    v  ef 


2122  =2.5  %  of  whole 


Average  omitting  end 
terrs      =      11 .4  % 


The  x's  here  tabulated  are  measured  from  an  origin 
at  the  age    10.0925,  and  hence  fall  within  accuracy  of  plot- 
ting at  the  integral  ages    4,  5,  -  -  -  19.      The  curve  is 
continuous  but  the  plotted  data  consisted  of  isolated  points. 
If  the  number  at  any  age  as  4  be  taken  as  one-half  the  sum 
of  those  included  in  the  year- intervals  immediately  preceding 
and  following  there  will  be  a  possibility  of  comparing  the  or- 
dinates  of  the  probability  curve    ith  those  of  the  observation 
polygon.      The  results  L.re  given  in  the  table  above.      The  fit 


is  not  good  and  is  the  poorer  e»s  one  recedes  from  the  centroid- 
al  axis.      The  fact  that  the  per  cent  error  including  the  end 
terras  including  the  end  terms  is  23.3  and  excluding  these  terns 
is  11.4  emphasizes  the  source  of  error.      This  error  stated  in 
brief  was  in  the  failure  of  quadrature  formulas.      When  the  area 
of  a  figure  is  considered  as  concentrated  at  its  centroid,  the 
product  of  the  area  and  the  distance  of  this  centrcid  from  the 
axis  of  moments  gives  the  first  moment  exactly,  for  the  cen- 
trcid is  defined  as  the  point  for  which 


But  when  the  second  moment,  is  computed  with  the  square  of  this 
same  distance  as  a  multiplier  for  the  area  this  assumes 

x  =  radius  of  gyration 


which  is  not  true.      And  in  the  higher  moments  this  same  sort 
of  error  enters  to  a  still  larger  extent  ,  for  an  error  in  the 
x  affects  the  result  the  more  seriously  as  the  x  is  raised  to 
a  higher  power.      That  this  is  true  and  that  the  error  will  be 
greater  in  terms  farthest  from  the  axis  of  moments  may  be 


or 


shown  by 


=  4x3x'  +  6x2(x'  )2-/-4x(x'  )5-f-(x'  )4 


where  x  is  the  true  value  and  x'  the  error. 

This  problem  will  perhaps  be  useful  as  an  illustra- 
tion of  one  of  the  type  curves  used  by  Prof.  Pearson,  full 
treatment  of  which  may  be  found  in  Philosophical  Transactions 
of  the  Royal  Society,  Volume  186A,  part  1,  pages  343  -  414. 
Prof.  Pearson  seems  to  have  fairly  well  settled  the  question 
as  to  the  relative  merits  of  the  method  of  moments  and  that  of 
least  squares  for  statistic?-l  problens  ,  and  has  shown  conclu- 
sively that  his  various  types  of  curves  derived  from  the  point 
binomial  may  be  fitted  by  the  method  of  moments  to  large  class- 
es of  observa.tion  curves  where  the  method  of  least  squares 
has  nothing  better  or  more  flexible  to  offer  than  the  normal 
frequency  curve. 
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QUADRATURE    FORMULAS . 


Since  the  failure  to  obtain  an  accurately  fitting 
curve  in  the  problem  solved  is  manifestly  due  to  lack  of  qua- 
drature formulas,  a  section  of  this  paper  will  "be  devoted  to 
discussing  some  common  formulas  and  their  suitability  to  the 
matter  in  hand.      The  treatment  is  in  terms  of  the  calculus 
of  finite  differences,  and  the  notation  used  is  as  follows: 

w  ,  w      „  .  -  -  =  any  function  of  x ,  the  same  function 
x      x  4- 1 

of     x  4-1  ,   of    x  4- 1  ,  -  -  - 

Dw    =  the  increment  in  the  function  due  to  a  unit  in- 
x 

crease  in  x    =  w        -w  . 

x  4-1  x 

D^wx  =  result  of  repeating  operation  denoted  by  D 
=  D(Dwx)  =  Dwx  +  1-Dwx 

=  Wx  4-  2-^X4-  l"fWX  4-l-WxI- 
To  illustrate ,  let  w    =  x  . 

Taking  x  =  1 .  2  ,  -  -  -,  S 


:  4- 1  ' 

V'x4-7  ~ 

1 

8 

27 

64 

125 

216 

343 

Dwx  , 

Dwx  +  1'  " 

X  **"  U 

7 

19 

37 

61 

91 

127 

169 

D  wx, 

D  v'x4-l>  " 

D"x+5  = 

12 

IS 

24 

30 

36 

42 

D3wx  , 

^x  +  4  = 

6 

6 

6 

6 

6 

D4w  , 
x  ' 

DS+i»  - 

^vx  4  3  = 

0 

0 

0 

0 
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The  abbreviated  notation  D    is  used  sometimes  for  D 


By  definition , 

r       t»-1  r-1 


r-2      r-2  r-2  r**2 


=  Dn  4-2Dn+1+Dn+2 
r-2        r-2  r-2 


=  Dn    +  3Dn+l+  3^4-2^.  Dn+3 
r-o        r-3  r-3 


0        0  g, 

as  may  be  easily  shown  by  induction  or  seen  by  inspection 
since  the  coefficients  are  formed  by  the  same  succession  of 
additions  as  those  in  finding  the  nth  power  of  a  binomial. 
Also  by  definition 


wx+l     =  Wx  +  D0 


wx*2    =  wx+l+Dl 


v/x+3    =  V2+D2 


Hence  from  the  way  in  which  the  coefficients  result  from  suc- 
cessive additions 

w        a  w    +rD0  +  r  ( r-1 )       4*  +  vF 

x+r        x        u  T — gT~      0  0 

=  wv  -*-rDw„  fr(T-l)p\,    -  -  -  ^-Drwv 
x  — 57-        x  x 

If  x  =  0    and    r  =  x 

?  5 

W-  =  Wq  4-xD\7q  4-X(x~1)D  Wq  4-x(x-1)(x-3)D  Wq 

2  •  3  • 

a  form  which  has  "been  called  the  conjugate  of  Iviaclaurin'  s 
the  orem . 

Suppose  then  the  problem  given  is  to  compute  the 
area  under  a  curve  from    n  -hi    equidistant  ordinates. 

Let  it  be  assumed  as  in  the  original  development  of 

the  method  of  moments  that  there  exists  an  equation 

w    =  0 
x 

which  exactly  fits  the  curve  in  question  and  whose  expansion 
"by  Taylor's  theorem  converges  fairly  rapidly.      As  will  be 
shown  in  the  section  on  the  choice  of  form  of  function  to  fit 
given  data,  this/i.s'  equivalent  to  saying  that  the  successive 
orders  of  differences  diminish  rapidly. 

Let  the  y-axis  coincide  with  the  first  ordinate. 
Then  the  successive  ordinates  are    Wq,  w^ ,  Wo,  -  -  -  w 

By  the  first  part  of  this  paragraph , 
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wx  =  wq  +xDvy0  +x  ( x-1 )  p2y/o  4-  x  ( x-l )  ( x-2 )  DawQ  -t- 


o  » 


and  the  area  und^r  the  curve  is 


rn 

w  dx  = 
0  x 


dx  -t-Dw 


iofn^3- 

I  /0 


h 

X  dx  -f-D 


3 

t4?o 


0 

3  ,„2 


v/      ( x^-} 

r°/o 


x^-x)dx 


3x^f-2x)dx  +■  D4wn     (x4-6x3  -f-llx2-Gx)dx 
4'.  U/0 


(x5-10f-35x-50x2  +  24x)dx 


0 


,  D6v/  I   (x6-15x5  +35x4-225x3+-274x?'-120x)dx +  -  - 
^TT0/  0 

-  nwn  +nf  Dv/  +  ( n^-n2 )  D^wn  f  ( n  -n3  +  n2  )D°wn 
.R  .  ,,_J5  iJgv-A      .  /    6  «._5  .  „m  4 


K  9 


+  ( n^-3n*  -h  lln^-5n~  )P*v/   +  (n6-2ri5+  3J5n*-50n° 
3  41°      6  4  3 


+  12n3)D5.7n 
5'. 


( n7-5n6  +  17n5-235n4  +  274n5-60n2 )  lArn  +  


-0 

7     2  4  3  61 

If  only  2  ordinates  are  given  the  area  is  a  trapezoid 

and  the  area  :.vould  ordinarily  be  computed  by  Wq  +-  w-  .  Applying 

2 

this  general  formula  gives  since  n  =  1    .  and  D2w    -  0 

0 


n 


0 


W    dx  =  w  -f  1/2  Dv/_ 
x  0  0 


wo*wi 


2 
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For  a  series  of  adjacent  trapezoids  each  ordinate  except  the 
first  and  last  'must  be  counted  in  two  trapezoids  and  hence 

Area  =  1/2  (wQ  -l-w-  )  +      -*-Wg  -t-w3  -t-  -  -  -  *~w£_i 

whore  ?  -hi  is  the  number  of  ordinates.  If  the  ordinates  are  a 
at  distances  h  instead  of  unity 

I      Jz  dx    =  hj~zQ  +-z2  "**2(z1  -h  zo  +-  z„-h  -  -  ~  +z  )J 

p  n 

where  x  represents  successively  y,  xy,  x^y,  -  -  -    x    y  and 

z    is  the  value  of  z  when    x  =  rh. 
r 

rr 

If  only  three  ordinates  are  known    n  =  2    and    D  Wq=0 
and  hence  the  area  is 


ijyr-  2w0  -h  2Dwn  -f-  (  8-2 )  D2w, 
0  X  U      3  21 

=  2wQ  -K2DwQ  lD2wQ 
3 


SWq  +  2  ( -w0  )-#-!(  ^2""2wl    w0  ^ 


3 


This  is  of  course  equivalent  to  assuming  a  second  degree  parabo- 
la passed  through  the  three  points  and  finding  thei  area  under 
this  parabola.    Itis  the  common  form  of  Simpson's  rule,  and  is 
readily  extended  to  any  figure  where  an  odd  number  of  ordinates 
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ar  =  given,    for  there  are  then  an  even  number  of  elementary 
areas  and  if  they  be  grouped  in  2's  the  total  area  will  count 
the  first  and  last  ordinates  once  each,  the  second,  fourth, 
sixth,  etc.  four  times  each  as  a  nid- ordinate  ,  and  the  third, 
fifth,  seventh,  etc.  each  twice,  once  as  a  last  ordinate  in  the 
preceding  area,  once  as  initial  ordinate  of  the  succeeding  area. 
If  the  ordinates  are  spaced  at  a  distance  h  from  each  other  in- 
stead of  at  unit  distance,  Simpson's  rule  becomes,  for    2p  +1 
ordinates , 

II  dx  =  hJz0*-Zgp  Y-4(z1+-z3  +-z5+  fz2p-l^2(z2fz4+-f'-op_o) 

It  is,  in  general,  a  better  form  for  quadrature  than  the  assump- 
tion that  the  area  is  a  series  of  trapezoids  just  as  a  smooth 
curve  through  a  set  of  points  is  a  better  representation  of 

their  lav/  than  the  straight  line  polygon  which  they  determine. 

4 

If  four  ordinates  are  given    n  =  3    and    D  i?q  =  0. 
The  area  then  becomes 

\idx=  3w0t  9Dw0+  (9-9)D2wQ+  (81  -27*  9)p3w 
0  X  2  2  2*.  4  3  0 

=  3wQ  +-  9(  w.-Wq ) f  9(  Wp-2w1  +  w0 )+  3  ( Wg-SWg  +■  Sw^Sq ) 
2  4  8 

-  3wn  +-  9w.  +-  9w0+  3w„    =  3jwn-H3wn  4-3wo  +■  w,  ) 
This  result  is  Newton's  rule,  is  perfectly  accurate  for  third 
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degree  parabolas  and  is  in  general  for  any  ordinary  function  a 
closer  approximation  than  Simpson's  rule.      It  has  the  disadvan- 
tage of  applying  only  to  those  cases  •There  the  number  of  ele- 
mentary areas  is  a  multiple  of  3.       V/ith  the  same  notation  as 
used  in  generalizing  Simpson's  Rule,  this  gives  for  a  number  of 
elementary  areas  bounded  by    3p+-l     ordinates  and  placed  at 
equal  distances  h, 


III    z  dx  - 


4-^z1-+.z2+.z4  +  z5t  z7+-z0  -  -  z 


If  five  ordinates  are  given,  n  =  4    and  D  w0  =  0. 


Then  the  area 


0 


wx  dx  =  4wQ  t-  8DWq  4»(  64-8)D -Wq  f-(  64-64  #I6)D  w, 
1  3      2'.  5 '. 


0 


t  ( 1024-3 84+ 704-43 )Dfw 


5  3  4'. 


=  4w0  +  3 ( w, -& Jf  20( Wo-2w-  f  wn )  +  3 ( w„ -oyi 


14(w4-4v/3-4-5w2-4v;14-v/0) 

45 


2 
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Generalizing  as  before  for  4p -hi  ordinates  at  equal  distances 
h. 
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IV    jz  dx  =  |h[7(z0-f-z4p)+32(z1+.z3^z5+z7+-  - 

*~12(zg*  zG  fz1Q4-  Z4p-o) 

I  14(z4+z8+-z10-f  +  z4p-4>] 

The  form  taken  by  this  formula  when    n  =  6    is  especi- 
ally good  for  data  like  those  fitted  by  a  sine  curve,  where  no 
order  of  differences  vanishes  but  they  do  diminish  in  size  ra- 
ther rapidly. 

w£WP=  6w0-M8Dw0-f-  (72-18)D'^yQ  4-( 524-216    56 )D°wQ 
0  2 1  3 • 

+ (7776-1944  +  792-108)Dj7n 
5  41 

5. 


+  (7776-15552  1 11540-5  600  -f  452) D°w0 


5'. 


t  (27  9956-116640  +-1321 92-72900  -f- 1972  3-21 60  )D6wQ 
7  6'. 

=  6v/  -t-  13DwA  +  27D2wn  +-  24D3wn  +  123D4w.    33D5w.  +  41  D°w0 
0  0  0  U    To"      0    10      0  140 

Since  D  Wq  is  snail  a  very  slight  error  is  introduced,  while  the 

labor  in  application  is  much  reduced,  by  substituting  for  the 

last  coefficient    42  =  __3  .      The  formula  then  becomes 

140  10 

Area  =  3  *"W2         ^"WG     5  ^  wl  +  v/5  ^  +  **w5  J 

7 

It  is  a  peculiar  fact  that  the  coefficient  of  D  w    vanishes  i- 
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dentically,  for  thequantity  to  be  integrated  (page  38)  is 

x(x-l)  (x-6)pV 

0  71  U 

D7v/0  |(x7-21x6  -f-175x5-735x4+-  1624x3-17  64x2  +■  720x) 


71 


'O 


-  D 

7 


_w  rn8~3n7-f-  175n6-147n54-  406n4-588n3+  3  60n2)  _6 

:  °[3  6  J: 


Putting  6  for  n  the  torn  reduces  to  zero.  Then 


o 

10 


[Wn  fW0  +  W  ■*-  W  +•  5  (  V7-  +  W„  )  +  8w~  I 
.  u      a      4      g         J-      5  «jj 


gives  the  area  exactly  if  the  fifth  order  of  differences  is  con- 
stant  and  with  an  error  of  only    1  D  w      when  either  sixth  or 


0 


140 

seventh  order  of  differences  is  constant. 

Generalising  for    6p  +1     ordinates  at  equal  distances  h 

V        fz  dx  =  3|[z0+-z2-hz4+-z6  +  zsf-z10+  ^z6p 

-h  5(Zl+z5  +  z7tzu*  ^z6p-l) 

f  6(z3+  z9  +  z15+  +Z6p-3)"I"2(Z64*Z124*-  -  ^zCp-G)] 

This  formula  is  called  We&dm's  Formula  and  is  of  much  use  and 
groat  accuracy  in  quadratures  where  there  is  really  a  mathemati- 
cal law  expressed  in  the  data  as  in  case  there  actually  is  a 
sine  curve  whose    constants  are  to  be  determined  by  a  few  iso- 
lated points.      But  manifestly  in  fitting  a  set  of  very  irregu- 
larly placed  points  which  represent  for  instance  a  statistical 
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problem,  where  there  is  no  reason  to  expect  continuity  and  where 
the  irregularity  may  bo  due  to  the  limited  number of  observations, 
this  formula,  which  in  effect  assumes  a  sixth  degree  parabola 
through  each  group  of  seven  points  is  no  more  likely  to  be  true 
than  the  assumption  of  a  succession  of  trapezoids.      All  the 
formulas  of  this  article  are  subject  to  the  disadvantage  that 
they  give  to  certain  ordinate s  and  hence  to  certain  observations 
greater  weight  in  the  determination  of  the  final  result  than 
to  others.      Thus  all  of  them  make  the  end  points  less  important 
in  determining  the  curve,  and  the  result  is  that  the  end  points 
are  not  so  closely  fitted  as  the  others. 
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THE  PARABOLIC  CURVE. 


y  =  a  +■  a^x  -^a^x  +>  a^x  -f*  -  -  -  f  anxn 

has  been  so  universally  accepted  as  a  suitable  function  to 
represent  most  engineering  and.  many  other  formulas  because  of 
its  simplicity  and  because  of  the  readiness  with  which  the  me- 
thod of  least  squares  can  be  applied  to  it.       It  is  not  the 
purpose  of  this  paper  to  advocate  the  indiscriminate  use  of 
this  function,  which  the  writer  thinks  has  been  much  used 
where  other  functions  would  answer  the  purpose  better,  but  it 
can  be  shown  that  even  here  where  the  method  of  least  squares 
is  best  adapted  to  the  end  sought ,  the  method  of  moments  is 
better.       It  will  be  remembered  that  theoretically  one  is  as 
good  as  the  other  because  the  method  of  moments  for  functions 
of  this  form  is  not  an  approximation  but  is  absolutely  accurat 
The  advantage  lies  in  the  fact  that  much  of  the  work  can  be 
done  once  for  all  and  the  results  appliedwith  a  reduction  of 
the  labor  involved  to  the  solution  of  the  individual  problem 
in  hand. 

Let  the  assumed  function  be 


46 


y -/.[*.  +  *.  (f)  (f) "  -'j 

where  2  /  is  the  total  range  of  observations  along  the  axis 
of  x. 

A  =  area  =  2  /  yQ 


Assume  the  origin  at  the  middle  of  the  range  ,  then  for  even 
mome.it  s  _ 

m  zty  +  +  ^   7 


for  substituting  the  limits  /  and  -/  make  even  powers  of  x 
cancel  and  give  twice  those  terms  containing  odd  powers. 

Z2r  LZr+j     zr+3  r  gr+s  J 

t        flSf  -    <f>°   +  -h  -h  

?*r   ~  2r+/     ?r+3  gr+g 

For  odd  moments  similar  reasoning  gives 

-  **ty*  fjfo  ^      +  -] 

tt  1  «.  ^u.  +  +  
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s 

-  m3 

'  f  ' 

TVi  fl  ti   'Pi*  nm  T 

u 

=  ;I0  = 

r 

i    =  1  =  ca    e0  +■  qa.  +-  e g  +  *  -  - 

1                        3       5  7 

Lo 

—   I^r>  — 
J* 

en    So  .  e.     q  r  j.  —  —  — 

3       5       7'  9 

4 

_  in  A  _ 
—     «±  — 

?* 

5       7       9*  11 

From  II 

1 

=  -1  = 

Z 

e,  j-  e-  4.  e   +  en  +  ~  -  - 
—1 r  —3  ^  — r  '  —7 

3       5       7°  9 

T 

3 

=  »"3  = 

e-.  f  g„  f  er  f  G7  +  -  -  - 

5      7°     9  11 

Then  for 

a  second  order  parabola, 

u 

3 

l 

=  ^1 

3 

T, 

=  ©0  + 

§9 

<•> 

Solving  for  the 

e's 

III 

=  3(3L 
4 

0  ~  5L2^ 

-3Lj 

1  6? 

=  15(3 
4 

L2  "  V 
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By  the  use  of  III  the  constants  may  be  computed  di- 
rectly as  soon  as  the  area  and  first  two  moments  of  the  obser 
vat ion  curve  have  been  determined. 

For  the  third  degree  parabola, 


L0  =  G0  t|2 

6 


Ll  =  £  i  +  £3 

3"  5 


L0  =  ®0  +  ®2 

3  5 


6      5  7 


IV 


Waence 


en  =s  3(3Ln  -  0Lo) 
U      4  0 

e-,  a  15(5L_   -  7L*) 
1        4  1 

eQ  =  15(3L2  -  LQ) 
4 

e    =  35(5L*  -  3L-, ) 
3        4  0 

For  the  fourth  degree  parabola 

y  -  y0feo+eif  Wf)  WfJ  ^94/fJ  J 


0  ^0'~^T^4 

o  o 


L 


1       3  5 
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L2  "  £0+-P+®4 


3      5  7 


=  e  +  e„ 
5-1  7 


4      5      7  9 


Whence 


en  s  i§(15Ln  -  70L9  +  63L  ) 
u       64        U  4 


e.   =  15(5L    -  7L~) 
1      ~4  1 


V        e     =  105(-5Ln  +  42LQ  -  45L  ) 
2      "32        °  2  4 


8     =  33(5L     -  3L-.) 
3         J  o 


e4  =  315 (3LQ  "*  30Lo  +  35L4) 
64 


For  the  fifth  degree  parabola, 

? 


y 


e„  =  15(15Ln  -  70Lo  +  63LA) 
0       64        °  2  4 


e     a  105(53Ln   -  126L„  f  99L  ) 
1      ""5o        1              3  5 

3ZI 

ert  =  105(-  5Ln  -f  42L0  -  45L  ) 
2—0            2  4 


e-  =  315 (-  21L-,   +  90L„  -  77L-) 
3      ~64  1 
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e.  =  315(3Lo  -  30Lo  +  35L. ) 
64  *  * 

e     =  693(15Ln   -  70L~  4-63L,-) 
For  the  sixth  degree  parabola 


e0  =    35(35LQ  -  315L    +  3  93L,  -  429L-) 
256 

9,  =  105 (35L    -  126L5  +  99L5) 
64  1 

e0  =  315  (-  35LQ  f-  567Lp     -  1485L4  +  lOOlLg) 
256 

VII     q„  =  315  (-  21L-1  +  90L„  -  77L-) 


e    a  5465 (7Ln  -  135LQ  +  3S5L    -  275LA) 

4  256      °  *  4 

e    =  693(15Ln   -  70L,,  f-  63L-) 

5  "II        1  3  5 

e-  =  3003(-  5Ln  +  105Lo  -  315L„  +  231LA) 
o      — :  U  2  4  ° 

25  6 


Illustrative  problem  on  velocities  of  the  Mississippi 
River  at  different  depths.  Let  it  be  required  to  fit  a  second 
degree  parabola  to  the  following  data. 


X 

y 

X 

y 

X 

y 

0 

3.195  0 

.3 

3.2611 

.7 

3  .1266 

.1 

3.2299 

.4 

3 .2516 

.8 

3  .05  94 

o 

3.2532 

.5 

3.2282 

.9 

2.975  9 

.6 

3.1807 

where  x  =  the  fraction  of  the  depth  at  which  the  observations 
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were  taken. 


y  =  observed  velocity  in  feet  per  second. 


Using  Newton' 


s  Rule  for  quadrature 


the  work  is  tabulated  as  follows.      Columns  narked  x  and  y  con- 

o 

tain  the  data,  those  marked    xy    and  x^y    products  of  corres- 
ponding values  of  the  variables.      The  column    S(y)  contains 
the  successive  terms  of  the  y  column  multiplied  by  thenumerical 
factor  which  is  assigned  to  it  by  Newton's  formula  above,  that 
is,  the  first  by  1,  next  two  by  3,  next  by  2,  etc.      Hence  the 
sum  of  this  column  multiplied  by  3/8  h    gives  the  total  area. 
The  column  S(xy)    and    S(x2y)     in  the  same  way  give  the  first 
and  second  moments. 


X 

y 

S(y) 

xy 

S(xy) 

xsy 

S(xSy) 

0 

3  .1950 

3.1950 

0 

0 

0 

0 

.1 

3  .2299 

9.6897 

» o  *3  2  Q  9 

.96897 

.032299 

.096897 

.2 

3.2532 

9.7596 

.65  054 

1 .95192 

.130123 

.3  90384 

.3 

3 .2611 

6.5222 

.97833 

1 .95  666 

.293499 

.586998 

.4 

3.2516 

9.7548 

1 .30064 

3.90192 

.52025  6 

1  .560758 

.5 

3.2282 

9.6846 

1.61410 

4.84230 

.807050 

2.421150 

.  6 

3.1807 

6.3  614 

1.90842 

3.81634 

1 .145052 

2.290104 

.7 

3.1266 

9.3798 

2.18862 

6 .56536 

1  .532034 

4.596102 

.8 

3.0594 

9.1782 

2.44752 

7.34256 

1  .958016 

5 .874048 

.9 

2.9759 

2.9759 

2.67831 

2.67331 

2.410479 

2.41047  9 

7  6.5012 


34.02534 


20.226930 


AMQ  =  3h  x  76.5012 

8 


h 


10 


6 


x  76.5012  =  2.863795  =  A 


30 
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AMi  =    3  x  34.0254  =  1.275950 
80 

Ul  =  .444769 

mo  =  _3  x  20.226930  =  .7585099 
2  80 

Mg  =  .264400 

But  these  Li's  are  calculated  about  a  y-axis  at  one 
end  of  the  range,  and  in  order  to  apply  formulas  V,  the  M's 
about  an  axis  through  the  mid-point  of  the  range  are  required. 
These  might  of  courso  have  been  calculated  originally  but  since 
the  mid-point  comes  at    x  =  .45  each  x  would  have  been  ex- 
pressed in  tvro  figures  and  the  arithmetical  work  thereby  in- 
creased.     The  axis  of  moments  must  now  be  shifted  to  this 
mid-point,  a  distance  of  .45.      Hence  by  the  formula 

M'  =  M„  -  dnl  .L,         dgn(n-l)Ll    0  +  

derived  on  page 

%  =  Lio  =  1 

Ml  =        -   .45  =  -.005231 

M*  =  M0  -   .m,  +  .2  025  =  .066608 
2        2  1 

2  1=  the  range  =  .9 
1  a  .45 


Curve  showma  relafion  of 

Velocity  and  Depth 

in 

fr?/ ssi  ssi  ppi  Rive  r. 
Method  of  Moments. 


o 


Fra ct/on  of  total  depth.  Plate  II 

,x     .3      4     S     U      7     .g      .9  jo 
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v  a  ^ —  sb  3.187550 
^0      2  1 


L  =#0  =  1 
^0       1  X 


Ll   =  #1  =  .01L624 


1 


1 


LQ  =  Sfs  .328928 
2  x« 

e0  ~  3(3L0  -  5L0)  *  1.016520 
4 

ex  =  3L  =  -.034872  By  formula  HL 


e„  =  15(3L2  -  L0)  =  -.049560 
4 


Hence  the  equation  of  the  curve  referred  to  an  axis  through 
the  middle  of  the  range  is 

y  =  3.187550(1.016520  -   .034872  x  -   .049560  x2) 

.45  .2025 

Transferring  back  to  original  axis  In  order  to  have  a  more  con- 
venient form  of  equation  by  using  the  substitution 

x  =  x»  -  .45 

y  =  3.18755(1.001832  +  .142774  x  -   .244741  x2) 

-  3.193389  +  .455099  x  -   .780123  x2 

The  same  data  will  now  be  fitted  by  a  function  of  the 
same  form  by  the  method  of  least  squares  and  the  amount  of 
work  involved  in  the  two  methods  compared. 


1/ 


Observation  equations 
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Assume    y  =  a  +•  Tax  +  cx~ 
Substituting  simultaneous  values  of  x  and  y 

3 .1950  =  a  *  0.  b  +  0.  c 

3.2299  =  a  +     .lb    +  .01c 

3.2532  =  a  +     .2b  +  .04c 

3.2611  -at     .3b  +  ,09c 

3 .2516  =  a  +     .4b  t  .16c 

3.2282  =  a  +      .5  b*  .25c 

5 .1307  =  a  +     .6b  *  .36c 

3.1266  =  a  +     ,7b  +  .49c 

3  . 05  94  =  a  -t     .  8b  +     .  64c 

2.975  9  -  ajr      .9b  -V  .81c 
I      31.7616  =10a+ 4.5b  +  2.85c 

I  is  the  first  normal  equation  for  all  the  coefficients  of  a 
are  unity. 

Multiplying  each  equation  through  by  the  coeffici- 
ent of  b  and  forming  second  normal  equation  gives 


.32299  = 

.la  + 

.01b 

+ 

.001c 

.65064  = 

.2a  + 

.04b 

,008c 

.97833  = 

.3a  + 

.09b 

+ 

.027c 

1.30064  = 

.4a  + 

.16b 

.064c 

1.61410  = 

.5a  +■ 

.25b 

.125c 

1.90842  = 

.6a  -h 

.36b 

.216c 

2.18862  = 

.7a  + 

.49b  -* 

.343  c 

DO  . 

2 .44752 

=     .8a  t 

.64b   +  .512c 

8.67831 

=    .  9a  + 

,81b  4-  .729c 

11        14 . Uodo 1 

*  4.5a*   2.85b*  2.025c 

Multiplying  through  by  coefficients  of  c 

and  forming  the  third 

normal  equation  gives 

.032299 

=     .01a  + 

,001b   +  ,0001c 

.130128 

=     .04a  t 

,008b  *  ,0026c 

.293499 

=  .09a 

,027b  +  ,0081c 

.520256 

=     ,16a  + 

,064b  +  ,0256c 

.807050 

=     ,25a  + 

,125b  +  .3125c 

1.145052 

=  •   .  3  6a  + 

,216b  +  ,1296c 

1.532034 

=     ,49a  + 

,343b  4  ,2301c 

1 .058016 

=     ,64a  -i- 

,512b  #  ,4096c 

2.410479 

=     ,81a  +• 

,729b  +     ,65  51c 

III  8.828813 

-  2.85a  + 

2.025b  +  1.5333c 

The 

throe  equations 

I  31.7616 

=  10  a 

+•  4.5     b  +•  2.85 

c 

II  14.08957 

=    4.5  a 

f  2.85  b  +  2.025 

c 

III  8.828812 

>  =a    2.85a  +  2.925b  +  1.5333c 

must  be  solved  for  a,  b 

and  c . 

I  28.58544 

=  9a  +■  4.05b  +  2.565c 

II  28.17914 

=  9a  f  5 .70b  +  4.050c 

IV        - .4063 

1. 

65b  1.485c 

C u rve  show i ncj  relation  of 

Velocity  and  Depth 

In 


Missis  sip  pi  River. 


Fraction  of  total  depfh.    Plate  III. 
o       j      .x      4       *       s~      .c       7       s       $  to 


II  26.436439  =  8.55a  +■  6.075b  +  4.5  999c 

III  26.770165  =  8.55a  £  j5 .416b  4»  5.8475c 
V  -.233744  ~  .660b  -f  ,7524c 

IV  -   .8126      =  3.3b  -V  2.97c 

V  -1.41872     =  3.3b  *  5.762c 

VI  -   .60612    =  .792c 

c    =  -  .7655 

IV  -   .4063  a  1.65b  -  1.13  64715 

b  =  .44253 

I  31.7616  a  10a  +  1.991335  -  2.181105 

a  a  3.19513 

Hence  the  equation  is 

y  a  3.19513  *   .44253  x  -   .7653  x2 

All  the  work  for  solving  this  problem  by  both  methods 
has  been  given  in  order  that  the  amount  of  labor  involved 
might  be  compared  in  a  problem  where  all  the  possible  advanta- 
ges were  on  the  side  of  the  least  squares  method.      Here  it  was 
required  to  transfer  the  axes  in  the  final  result  to  a  certain 
position,  namely,  with  the  origin  on  the  surface  of  the  water. 
This  was  not  a  necessity  inherent  in  the  problem  of  curve-fit- 
ting but  was  nocessary  for  convenience  in  using  the  formula 
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and 

for  con1. 

parison  of  results  of  the  two 

methods 

It  \ 

vill  be 

see 

n  that  even  with  this  additional  labor  the  method  of 

moments 

is 

much  the 

less  laborious  method.  It 

remains 

to  tabulate  the 

relative  accuracy  of  the  results. 

Observed 

Least  squares 

Lioments 

o 

X 

y 

y          v  v° 

y 

V 

d 

V 

0 

3.1950 

5.1951     -.0001  .000000 

3  .1934 

.0016 

.000003 

.1 

5.2299 

3.2317     -.0018  3 

3.2311 

-.0012 

1 

.2 

3.2532 

3.2530       .0002  0 

3 .1532 

0 

0 

.5 

3.2611 

3.25  90       .0021  4 

3  .25  97 

.0014 

2 

.4 

3.2516 

3.2497       .0019  4 

3 .2506 

.0010 

1 

.5 

3.2282 

3.2251       .0031  10 

3.225  9 

.0023 

5 

.6 

3.1807 

3.1851     -.0044  19 

3  .1856 

-.0049 

24 

.7 

3.1266 

3.1299    -.0033  11 

3  .1297 

-.0031 

10 

.8 

3  . 05  94 

3 . 05  94              0  0 

3  .0582 

.0012 

1 

.9 

2.975  9 

2.9735       .0024  6 

2.9711 

.0048 

23 

1.0 

2 . 8724 

2.8684 

.000057  .000070 


Using  the  ordinary  least  squares  test  for  accuracy  the  probable 
error  in  any  velocity  calculated  by  the  least  squares  method  is 


r  =  .6745/700057     a  .0019 
f  10  -  3 

The  probable  error  in  a  velocity  calculated  by  the  formula  de- 
rived by    method  of  moments  is 


r  =  .3745 J. 000070    =  .0021 
I  10-3 

The  difference  in  the  degree  of  accuracy  is  then  so  slight  as  to 
be  negligible  even  when  the  test  of  accuracy  is  the  one  derived 
for  least  squares  where  the  function  minimized  was  the  sum  of 
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the  squares  of  the  residuals  at  the  points  chosen  while  the 
function  minimized  in  the  method  of  moments  was  the  sum  of  the 
squares  of  all  the  residuals  all  along  the  curve.      It  seems 
then  evident  that  even  for  the  parabolic  curve  ,  where  the  me- 
thod of  least  squares  is  most  useful,  in  a  curve  where  the  fi- 
lial   form  required  must  be  referred  to  a  particular  a.xis  ,  and 
using  the  ordinary  least  squares  test  for  accuracy,  the  method  of 
of  moments  gives  results  sensibly  as  goqd  as  least  squares  with 
less  labor. 

When  the  problem  does  not  include  transfer  of  moments 
from  one  axis  to  another  the  work  is  much  less.      As  an  example 
let  it  be  required  to  fit  a  second  degree  parabola  to  the  fol- 
lowing data  which  represent  a  cooling  curve  for  wat^r  when  sur- 
rounded by  a  freezing  mixture.      The  x's  are  the  time  in  min- 
utes and  the  y's  are  observed  Centigrade  temperatures.  x's 
are  results  of  transferring  the  y-axd.3  to    x  ~  10  the  mid- 
point of  the  range.      Quadrature  was  by  Simpson's  rule. 

Jz  dx  =  lh£z0+  z2p  -l-  2(z2  +  z4+-  +  z2p_2)  +  4(z14-z3+  -  +  Zo-0_l)j 

(See  section  on  quadrature,  pages  3  5~~  i  ^ . 
For  this  problem    h  =  1 . 

The  work  is  tabulated  as  described  on  page  5"/,  except  that  the 
numerical  fa.ctors  are  1,  4  and  2  instead  of  1  ,  3  and  2.  It 
depends  of  course  on  the  number  of  observations  what  quadrature 
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formula  can  be  used. 
Assume 

2  7 

y =  yo[eo^eii  +  s2ff)  J 


X 

X1 

y 

S(y) 

x»y 

S(x'y) 

x'~y 

S(x'Sy) 

0 

-10 

18.0 

18.0 

-180.0 

-  130.0 

1800.0 

1800.0 

1 

-  9 

15.1 

60.4 

-155  .9 

-  543.6 

1223.1 

4892.4 

2 

-  8 

12.8 

25.6 

-102.4 

-  204.8 

819.2 

1633.4 

3 

-  7 

10.8 

43.2 

-  75.6 

-  302.4 

529.2 

2116.8 

4 

-  6 

9.1 

18.2 

-  54.6 

-  109.2 

327  .6 

655  .2 

5 

-  5 

7.7 

30.8 

-  38.5 

-  154.0 

192.5 

770.0 

6 

-  4 

6.4 

12.8 

-  25.6 

-  51.2 

102.4 

204.8 

7 

—  o 

5.0 

20.0 

-  15.0 

-  60.0 

45  .0 

180.0 

8 

o 

—  i^, 

3.7 

7.4 

-  7.4 

-  14.8 

14.8 

29.6 

9 

_  1 

2.5 

10.0 

-  2.5 

-  10.0 

2.5 

10.0 

10 

0 

1 .45 

9  9 

0 

0 

0 

0 

11 

1 

.5 

2.0 

.5 

2.0 

c 

2.0 

12 

2 

-  .3 

.  6 

.6 

1  .2 

-  1.2 

-  2.4 

13 

3 

-  .5 

-  2.0 

-     1 .5 

6.0 

-  4.5 

-  18.5 

14 

4 

-  .7 

-  1.4 

-  2.8 

5.6 

-  11.2 

-  22.4 

15 

5 

-  1.0 

-  4.0 

-  5.0 

-  20.0 

-  25.0 

-100.0 

16, 

6 

-  1.2 

-  2.4 

-  7.2 

-  14.4 

-  43.2 

-  86.4 

17 

7 

-  1.4- 

-  5.6 

-  9.8 

-  39.2 

-  63.6 

-274.4 

18 

8 

-  1.7 

-  3.4 

-  13.6 

-  27.2 

-108.8 

-217.6 

19 

9 

-  1.9 

-  7.6 

-  17.1 

-  68.4 

-153.9 

-615.6 

20 

10 

-  2.1 

-  2.1 

-  21.0 

-     21 .0 

-210.0 

-210.0 

222.2 

-1831.0 

10752.4 

Hence 

AMq  =  A  =      222.2  ~  74.07 
3 

AIL  =        -  1351 .0  =•  -610.33 

3 

AM„  -  10752.4  =  3584.13 


uu . 

=  1 

T, 

-  Ml     sa      -610.33               _    _  SQAH 

1           74.07  x  10 

L 

2 

3534-13          =  .4839 
Is          74.07  x  100 

e0 

u 

=  3(3Ln  -  5L0)  =  .4354 
4 

Sl 

—    OIj^                                —    —  .3  .4:  /<d 

=  15  ( 3Lo  -  L«)  =  1.694 

Us 

4 

yo 

A  74.07 

Hence  equation  is 

y  ' 

=  3.704(.4354  -   .2472  x       .01694  x2) 

=  1.61  -   .916  x  +■  .0627  x2 

The  following  table  shows  the  differences  between 

the  observed 

values 

• 

of  temperatures  and  those  which  would  follow  the  law  ex- 

pressed 

in  this  equation. 

Observed    Computed  Difference 

2 
v~ 

X 

x*             y                 y  v 

0 

-10            18.0           17.04  .96 

.9216 

1 

-  9            15.1            14.93  .17 

.0289 

o 

t-J 

-  8            12.8            12.95  -.15 

.0225 

3 

-  7            10.8            11.09  -.29 

.0841 

4 

-  6              9.1              9.36  -.26 

.067  6 

5 

-  5               7.7              7.76  -.06 

.003  6 

6 

-  4              6.4              6.28  .12 

.0144 

7 

-  3              5.0              4.92  .08 

.0064 

8 

-  2              3.7              3.69  .01 

.0001 
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\y  \J  O  w  J.  V  «_/  V^l 

H  OF*  TY>  1  t  ft 

w  Will  Mm  u 

nif  f  ST*fi  nr.n 

-A. 

v 
J 

y 

V2 

V 

9 

_  1 

2  5 

2  59 

- .  09 

.  0081 

10 

0 

1  .45 

1 . 61 

-.16 

.025  6 

11 

1 

.5 

76 

-  ^6 

067  S 

12 

9 

-  .3 

.03 

-  .33 

.1089 

13 

3 

-  .5 

-  .57 

.07 

.  0049 

14 

4 

-  =7 

-1.05 

.35 

.1225 

15 

5 

-1.0 

-1  .40 

.40 

.1600 

16 

6 

-1.2 

-1 .63 

.43 

.1849 

17 

7 

-1.4 

-1  .73 

.33 

.1089 

18 

8 

-1.7 

-1  .70 

0 

0 

19 

9 

-1.0 

-1.56 

.34 

.1156 

20 

10 

-2.1 

-1 .28 

.82 

.6724 

2.7236 


The  probable  error  in  any  computed  y  is  then 

r  =  .  5745V  2/7.H§  =  .26 
J  21  -  3 

The  inaccuracy  of  fit  at  the  end  points  will  be  noted.  This 
is  to  be  expected  as  a  result  of  all  the  quadrature  formulas 
giving  less  weight  to  the  end  points  andis  one  of  the  weak 
places  of  the  method.       Omitting  the  end  points  reduces  the 
probable  error  to  .17.      The  average  of  the  absolute  values  of 
y  is  4.94  of  which  the  probable  error  is  about    5  %. 
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EXPONENTIAL 


FUNCTION,. 


Next  in  order  of  usefulness  to  the  parabolic  function 
is  the  exponential  function 


This  is  most  easily  dealt  with  by  passing  to  logarithms, 


and  fitting  this  form  of  curve  to  logarithms  of  data.  f(x) 
may  of  course  be  any  function  of  x  and  the  form  to  be  assumed 
would  depend  on  tiie  nature  of  the  curve  obtained  by  plotting  a 
set  of  points  with  abscissae  x  and  ordinates    Y  =  log  y. 
If  the  form  of  this  curve  is  parabolic ,  then  the  method  deve- 
loped in  the  Last  section  will  give  a  curve  fitting 


y 


Y  =  log  y  =  f(x) 


Y  =  f(x) 


and  hence 


y  =  e 


f(x) 


Let  it  be  required  to  fit  a  curve  of  form 


y 


a  f-  bx  +•  cx 

e 
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to  the  data  on  the  cooling  of  water  discussed  in  the  last  pro- 
blem.     The  first  difficulty  is  that  there  is  no  logarithm  of 
a  negative  number.      Hence  the  first  step  is  a  transfer  of  axis 
moving  the  x-axis  down  three  units,  by  the  substitution 
y»  a  y  +  5.      The  y-axis  will  also  be  taken  in  the  middle  of  the 
range  by  the  substitution    xf  =  x  -  10.      In  the  following  ta- 
ble x  is  time  in  minutes,  y  observed  temperature  in  degrees 
Centigrade,  x'  and  y'  as  given  above,    Y  =  log  y'     and  other 
columns  are  for  quadrature.       Simpson's  rule  was  used  in  quadra- 
ture.      (See  pages   3S~if.)      h  =  1 
The  assumed  function  is 

Y  =  a  f  bx1  +•  ex'2  -  yQje0  +  e^'  +  eoff)  ] 


X 

y 

x' 

r 

Y 

S(Y) 

x»Y 

S(x'Y) 

xt2y 

S(x,2Y) 

0 

is 

.0 

-10 

21.0 

3 

.0445 

"  3  . 044-5 

-3  0.4450 

-  3  0.4450 

3  04.4500 

304.4500 

1 

15 

.1 

-  9 

18.1 

2 

.8959 

11 .5836 

-26.0631 

-104.2524 

234.5  679 

938.2716 

2 

12 

•  8 

-  8 

15.8 

2 

.7600 

5.5200 

-22.0800 

-  44.1600 

17  6.6400 

553.2800 

3 

10.8 

-  7 

13  .8 

p 

.6246 

10.4984 

-18.3722 

-  73.4888 

128.6054 

514.4216 

4 

9 

.1 

-  6 

1  P  1 

2 

.4932 

4.9864 

-14,95 92 

-  29.9184 

89.7552 

179.5104 

5 

7 

.7 

-  5 

10.7 

9 

.3702 

9.4808 

-11 .8510 

-  47.4040 

59.2550 

237 .0200 

6 

6 

4 

•  ** 

-  4 

9.4 

o 

.2407 

4.4814 

-  8.9628 

-  17.925  6 

35 .8512 

71  .7  024 

7 

5 

.0 

—  3 

8.0 

.0794 

8.3176 

-  6.2382 

-  24.9528 

18.7146 

74 . 85  84 

8 

3 

.7 

-  2 

6.7 

1 

.9021 

3 .8042 

-  3.8042 

-  7.6084 

7 .6084 

15 .2168 

9 

2 

.5 

-  1 

5  .5 

1 

.7047 

6.8188 

-  1.7047 

-  6.8188 

1 .7047 

6.8188 

10 

1 

.45 

0 

4.45 

1.4929 

2.9858 

0 

0 

0 

0 

11 

.5 

1 

3.5 

1 

.2527 

5.0108 

1  .2527 

5 .0103 

1 .2527 

5 .0108 

12 

.3 

2 

o  7 

.9932 

1 . 9864 

1 . 9864 

3 .9728 

3 .9728 

7 .9456 

13 

.5 

3 

2.5 

.91 63 

3 .6652 

2.7489 

10.9956 

8.2467 

32.9868 

14 

.7 

4 

2.3 

.8329 

1 .6658 

3  .3316 

6.6632 

13 .3264 

25.6528 

15 

-1 

.0 

5 

2.0 

.6932 

2.7728 

3 .4660 

13 .8640 

17.3300 

69.3200 

16 

-1 

.2 

6 

1.8 

.5878 

1 .1756 

3 .5268 

7 .0536 

21 .1608 

42  .3216 

17 

-1 

.4 

7 

1.6 

.4700 

1 .8800 

3 .2900 

13  .1600 

23 .03  00 

92  .1200 
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x      y      x»     y'          Y         S(Y)          x'y          S(x'y)  x,LY  S(x,sY) 

18  -1.7       8    1.3     .2623       .5246      2.0984      4.1968  16.7872  33.5744 

19-1.9       9     1.1     .0953       .3812         .8577       3.4208  7.7193  30.8772 

.20  -2.1     10       .9  -.1054    -  .1054    -1.054-0     -1  .0540  -10.5400  -10.5400 

90.4785                  -388.0282  3025.8192 

68.3376 

-319.6906 


AM    =  1  S(Y)         =        30.15  95  =  A 
0  * 


AM.^  3  I  S(x»Y)     a  -106.5635 


AM    =  1  S(x,<dY)  =  1008.6064 
2  3 


2  1  =  20 


1  a  10 


y     =  A      =  30.1595  =  1.50798 


0 


2  1  20 


L     =  1,1       a  -.353333 
1  1 


L9  =  M      =  .334424 
*  1 


e0  ~  2(3  L0  -  5Lo)  =  .995910 
4  a 

e1  -  3LX  =  1.059999 


eQ  =  15(3L0  -  L0)     =  .012270 

*        4      B  w 


65  . 

Substituting  in  the  assumed  function 

Y  =  1.50798  (.995  910  -   .1059999  x'  +  .0001227  x'2J 

a  1.5018  -   .1599  x1   +  .000185  x2 
w,  _    1.5013  -   .1599  x'  +-  .000135  x2 

y  =  y'  -  3 

1.5018  -   .1599  x'  +   .000185  x2  a 
y  =  e  -  3 

Th3  following  table  shows  the  observed  and  computed 
values  of  y1  with  the  differences  between  then. 


Ob- 

Com- 

Differ- 

0b- 

Com- 

Differ- 

served 

puted 

served 

puted 

ence 

V2 

X 

x' 

Y 

Y 

ence 

y* 

y' 

V 

0 

-10 

3 . 0445 

3.1191 

-.0746 

21  .0 

22 . 63 

-1 .63 

2.65  69 

1 

-  9 

2.8959 

2.0557 

-.0598 

18.1 

19.20 

-1 .10 

1.2100 

2 

-  8 

2.7600 

2.7  927 

-.0327 

15.8 

16.32 

-  .52 

.2704 

3 

-  7 

2.6246 

2.6300 

-.0054 

13.3 

13.88 

-  .08 

.0064 

4 

-  6 

2.4932 

2.4677 

.0255 

12.1 

11.80 

.30 

.0900 

5 

-  5 

2.3702 

2.3058 

.0644 

10.7 

10.34 

.3  6 

.1296 

6 

-  4 

2.2407 

2.1443 

.0964 

9.4 

8.54 

.86 

.7396 

7 

-  3 

2 . 07  94 

1,9831 

.0913 

8.0 

7.28 

.72 

.5184 

8 

o 

1 .9021 

1 .8225 

.0798 

6.7 

6.20 

.50 

.2500 

9 

-  1 

1 .7047 

1 .6613 

.0429 

5  .5 

5.29 

.23 

.0529 

10 

0 

1 .4929 

1.5018 

-.0039 

4.45 

4.49 

-  .04 

.0016 

11 

l 

1.2527 

1.3421 

-.0894 

3.5 

3  .84 

-  .34 

.1156 

12 

2 

.9932 

1.1828 

-.1896 

2.7 

3.26 

-  .56 

.313  6 

13 

3 

'  .9163 

1 .0238 

-.1075 

2.5 

2.78 

-  .28 

.0784 

14 

4 

.8329 

,8652 

- .0323 

2.3 

2.38 

-  .OS 

.0064 

15 

5 

.6932 

.7070 

-.0138 

2.0 

2.03 

-  .03 

.0009 

16 

6 

.5878 

.5492 

.0386 

1.8 

1  .73 

.07 

.0049 

17 

7 

.4700 

.3  917 

.0783 

1.6 

1.49 

.11 

.0121 

18 

8 

.2623 

.2346 

.0277 

1 .3 

1.27 

.03 

.0009 
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Ob-  Com-  Differ-  Ob-  Com-  Dif far- 
served  puted  ©nee  served  puted  enco 
x       x '           Y-            Y  y '  y 1  v  v2 

19  9         .0953  .0779  .0174  1.1  1.08  .02  .0004 

20  10      -.1054  -.0785  -.0269  .9  ,.92  -,02  .0004 


The  probable  error  in  any  ordinate  is  then 

■  r  =  •6745/frM 

=  .39 


and  the  curve  is  not  so  good  a  fit  as  the  parabolic  curve  with 
three  constants. 

2 

The  smallness  of  the  coefficient  of  x    suggests  that 
an  exponential  curve  with  two  constants  night  give  a  curve  which 
would,  fit  the  data  fairly  well.      The  fit  will  not  of  course  be 
perfect  for  the  plotted  curve    Y  =  f(x)  is  not  a  straight  line. 
This  solution  willbe  really  fitting  the  best  straight  line  to 
this  curve  and  then  taking  the  exponential  of  both  members  of 
the equation  expressing  the  result. 

Assume 


Y  = 


^o[eo  +  elf  ] 


Only  two  constants  are  to  be  determined  so  area  and  first  mo- 
ment are  sufficient.      These  were  computed  on  pages  lo3  -  Gi~ 
and  the    following  values  found. 
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LQ  =  1 

L«   —  — •  .353333 

By  the  general  formulas  for  e's  of  parabolic  curve,  page  fjf 

e0  =  L0  =  1 

ei  =  =  -1.059999 

yQ  =  1.50798  page 
1  a  10 

Y  =  1.50798(1  -   .1059999  x)  Plate  Va 

=  1.508  -   .1599  x 

,  _    1.508  -   .1599  x 

y'  =  e 

„  _  J.. 608  -   .1599  x 

y  —  e  —  o 

Comparison  of  the  values  of  y  computed  by  this  formula  with  ob- 
served values  in  the  following  table  shows  that  the  exponential 
formula  in  two  constants  gives  results  which  are  sensibly  coin- 
cident with  those  given  by  the  exponential  in  three  constants, 
while  of  course  its  derivation  is  much  le ss  laborious. 
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W  L>  O  W  J.  V  w  L.L 

n  nm  Tin  +*  p  rl 

v  UuU  I   V  ^  \X 

n  orn  nnt 

\J  will  J.^       W  v^ 

v 

Y 

y 

X 

<y 

<y 

o 

-10 

"*  X  \s 

5  0445 

3  1 07  0 

21  0 

2°  3  6 

1 

_  0 

2 .8959 

2 . 9471 

18.1 

19.06 

O 

_  s 

2  7600 

2  7872 

15  .8 

16  23 

-  7 

2  62^-6 

2 . 6273 

15  .8 

15  84 

A 

2  4932 

p  A  674. 

12  1 

x  r_-  *  x 

11  80 

XX    •  w  V 

-  5 

2  3702 

2  3075 

10.7 

10  54 

6 

2 .2407 

2  ,1476 

9.4 

8 .57 

7 

1 

2  07  94 

1 . 9877 

8.0 

7  30 

ft 
0 

_  p 

1  90°  1 

1  8^7? 

6  7 

6  22 

q 

**  JL 

1  7  047 

1   667  9 

5  5 

5  30 

1  0 

0 

1  ^-929 

1  5080 

4  45 

A 

X  X 

"I 

X 

1  2527 

1  3481 

3  5 

3  86 

X  rj 

p 

9932 

1  1882 

9  7 

1  3 
X  ij 

g 

.  91 G3 

1   02  83 

2  5 

2  79 

14 

4 

.8389 

.8634 

2.3 

2.38 

15 

5 

.6932 

.7085 

2.0 

2.03 

16 

6 

.5878 

.5486 

1.8 

1.75 

17 

7 

.4700 

.3887 

1.6 

1.48 

18 

8 

.2623 

.2288 

1.3 

1.26 

19 

9 

.0955 

.0689 

1.1 

1 .07 

20 

10 

-  .1054 

-  .0910 

.9 

.91 

\ 
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THE  SINE  CURVE. 


A  type  of  curve  often  useful  when  the  data  plots  a 
periodic  or  repeating  curve  is 

y  =  a  sin  nx  +  b  cos  nx 

which  is  simply  a  sine  curve  with  the  y-axis  shifted  horizon- 
tally to  a  now  origin.      For  if 

y  =  a  sin  nx 

The  transformation    y  -  y' 

x  =  x '  -f-  k 

gives  y  =  a  sin  (nx  -h  nk) 

=  a  sin  nx  cos  nk  -4-  a  cos  nx  sin  nk 


=  b  sin  nx  +  c  cos  nx 

by  collecting  all  constants  into  a  single  coefficient. 

If  a  curve  of  this  type  is  to  be  fitted,  to  determine 
the  three  constants,  a,  b  and  n  will  require  the  area  and 

two 

f irstAmoments .      Let    2  1  be  the  total  range  of  observations 


I 
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and  take  the  origin  at  the  mid-point  of  the  range.  Then  the 
area 

(1)    AM0  =  A  =  J    y  dx  =  ^-co3  nx)  +  ~  sin  nxj 

2b  , 
-  —  sin  nl 

n 


1  ni 

(2)    AM-,  =  I    xy  dx  =  a  /    x  sin  nx  dx  +  b  /    x  cos  nx  dx 
(-1  1-1  J-l 


x  cos  nx  +  — ~  sin  nx  +  ^x  sin  nx  -f-  cos 


n 


n2  n  n*2 


nx  / 


=  -  —  1  cos  nl  +  £j|  sin  nl 
n  nd 


AMg  =  a J     xS  sin  nx  dx  +  bl  ^x2  cos  nx  dx 


S  x2  cos  nx-f-2  !£«  x  sin  nx  +  cos  nx  +  —  x^  sin 
n  n^  n°  n 


x  cos  nx  -  iiS    sin  nx 


I2  sin  nl  t  S  1  cos  nl  -  1^  sin  nl 

n^  n3 


Dividing  (3)  by  l2  and  then  by  (1) 


71 


Equation  (4)  then  must  be  solved  for  z  and  since  1  is  known 


z 

n  =  7 

X 


From  (I) 


b  =  n  A  Uo      =    A  lioz 

2  sin  z         2  1  sin  z 


From  (II) 


-  A  i\w   -        -  A  M»  -  ( 

3  1  cos  z  -  £  sin  z  2(~  cos  z  -  ~_sin  z) 
n  Z  zz 


A  Iv'I  i   «  .    -  A  Ivi . 


2  1  sin  z  (-  cot  z  -  -1      2  B  1    sin  z 
z  z2 


Let  it  be  required  to  fit  a  curve  of  the  form 


y  =  a  sin  nx  -4-  b  cos  nx 


to  this  data. 
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x               y  x  y 

0  125.0  5  71.5 

1  145.6  6  -72.8 
3  138.2  10  -133.7 


To  apply  the  formula  deduced  in  the  last  paragraph 
the  origin  must  he  taken  in  the  middle  of  the  range  by  the 
transformation 

x»  =  x  -  5 

y1  =  y 

With  this  transformation  thedata  becomes 


x'  y  x'  y 

-5  125.0  0  71,6 

-4  145.6  3  -72.8 

-2  138.2  5  -138.7 


Since  the  ordinates  are  not  equidistant  the  quadra- 
ture formulas  of  pp.  3  5"--  4  ^  do  not  apply.      The  area  is  hence 
considered  as  a  succession  of  trapezoids.      The  area  of  each  is 
1/2  (x2  -  x1)(y1+  y2) 


i  &± 

> 

s  &* — > 
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For  the  first  moments  take  axes  as  in  figure. 


A  Li, 


"b 


x1  -  0 


Equation  of  line  is 


v  —  v 


X 


b 


y    =     $a~  7j  x  -|-  y 

b  1 


0  b      3        g  5 


~    =    &J|i  =    l3g(S,y2-^  y,  )  =    b(2y^  y,  ) 

+   78)       5(y,4  y2) 


A 


This  formula  for  first  moment  and  centroid    applies  also  to  f 
gures  in  which  part  of  the  area  considered  lies  above  and  a 
part  below  the  x-axis.      For  proper  trapezoids  the  radius  of 
gyration  is  nearly  enough  equal  to  x  for  purposes  of  this  pro 
blem  but  in  case  part  of  the  area  lies  above  and  a  part  below 
the  x-  axis  the  second  moment  must  be  calculated  for  the  two 
triangular  parts  separately. 

In  the  table  columns  marked  x  and  y  are  the  data. 
x1  =  x  -  5  to  bring  the  origin  to  the  middle  of  the  range 


b  =  x    -  x    is  the  base  of  the  trapezoid,  A  is  the  area, 


b  0 
3  B 
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is  distance  fro^:  left  hand  boundary  to  centroid  "by  formula  on 
the  previous  page,  x    =  xj  -\-  ~  . 


B 

G 

b  G 

X 

y 

x' 

yl+  Jfe 

D 

A 

71  2y2 

o  B 

X 

0 

125.0 

-5 

170.6 

1 

135 

.3 

416.2 

.51 

-4.49 

1 

145.6 

-4 

283.8 

2 

283 

.8 

422  .0 

.00 

-3.01< 

3 

138.2 

-2 

209.8 

2 

281 

.4 

281 .4 

.89 

-1.11 

5 

71.6 

0 

-1.2 

3 

-1 

.8 

-74.0 

61.70 

61.67 

8 

-72.8 

3 

•211.5 

2 

-211 

.5 

-350.2 

1  .10 

4.10 

10 

-138.7 

5 

415 

.6 

M&1  =  xA 

x2 

AM2  =  x2 

A 

607  .50 

20.16 

2727 . 65 

854.24 

9.06 

2571.23 

232. S8 

1.23 

258.05 

111.01 

867  .15 

16.81 

-3555  .32 

-  328.10 


-2672.78  1673.51 

Calculating  the  second  moments  of  the  two  triangular,  portions  of 

the  fourth  trapezoid  separately  and  adding  them  gives  -328.1. 

—2 

This  sum  is  added  in  the  last  column  in  place  of    x    A  which 
does  not  apply  to  figures  of  this  kind. 
By  definition 

Substituting    1=5  and  values  of    AMq  and  AMg  found  on  last 


page 
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B  =  -  .4-194 

Hence  the  equation  to  be  solved  for  z  is 

~  -  .4194  z  =  cot.  z 
z 

This  equation  Will  evidently  be  satisfied  by  simultaneous  so- 
lutions of 

(1)  y  =  i  -.4194  Z  and 

z 

(2)  y  =  cot  z  (  z  in  radians  of  course.) 

But  each  of  these  is  a  continuous  curve  and  hence  plotting  as 
in  accompanying  figure,  Plate  VII,  will  help  us  to  an  approxi- 
mate solution. 

If    z  =  1,  (1)  gives     y  =  .5  906,  (2)  gives    y  =  .6421 
■     z  =  2,  (1)       "         y  =-.3383,  (2)       '»        y  =  -.4576 

Plotting  these  shows  that  there  oust  be  an  intersection  and  hence 
a  value  of  z  between  1  and  2. 

Taking    z  =  1.5,  (1)  gives    y  =  .0375,  (2)  gives  y  =  .0709 
Hence  z  lies  between  1.5  and  2,  and  apparently  much  nearer  1.5. 

Taking    z  =  1.6,  (1)  gives    y  =  -.0461,  (2)  gives    y  =  -.0292 

Hence  z  lies  between  1.6  and  2  nearer  1.6 
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Taking    z  =1.7,  (1)  gives    y  =  -  .1247,  (2)  gives  y  =  -.1299 
Hence  z  lies  between  1.6  and  1.7,  nearer  1.7 

Taking    z  =  1.67,  (1)  gives    y  =  -.1016,  (2)  gives  y  =  -.0995 

Hence  z  lies  between  1.67  and  1.7,  nearer  1.67. 

Taking    z  =  1.68,  (1)  gives    y  =  -.1094,  (2)  gives  y  =  -.1096. 

Hence  z  lies  between  1.67  and  1.68,  nearer  1.68 

Taking    z  =  1.679,  (1)  gives    y  =  -.1086,  (2)  gives    y  =  -.1086 

and  this  is  the  value  of  z  sought. 

By  hypothesis    z  =  nl  =  5n 
and  hence  n  =  .3358  =  19°  14'  23". 

From  formula  I , 

■b  =  A  ..:0  z  =    415.6  x  1.679  __  ?0  0 

2  1  sin    z  10  x  .9942 

From  II , 

'  _        -  A  Li i         _    -  2672.78 

2  B  l2sin  z    "  50  x  .4194  x  .994*2 

=    -  128.2 

Hence  the  equation  of  the  curve  is 

y  =  -  123.2  sin  £x(19°  14'  23")J  -f-  70.2  cos  ^x(19°  14»  23tt)j 

The  table  shows  the  degree  of  accuracy  of  fit  of  the  curve  to 
data. 
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Observed         Computed         Difference         %  error 
x         x*  y  1 


0 

-5 

125.0 

119.9 

5.1 

4.1 

1 

145.6 

141.7 

3.9 

2.7 

3 

138.2 

134.7 

3.5 

2.6 

5 

0 

71.6 

70.2 

1.4 

1.9 

8 

o 

-  72.8 

-  70.9 

1.9 

2.6 

10 

5 

-138.7 

-135  .0 

3.7 

2.7 
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THE  HYPERBOLIC  FORM. 


Analogous  to  the  parabolic  curve  in  a  power  £eri:;s  of 
positive  powers  of  x  is  thepyperbolic  curve  in  negative  powers 
of  x.      This  curve  is  of  the  form 


This  form  of  function  is  far  less  convenient  to  deal  with  from 
the  stank  point  of  integration  than  the  parabolic  form  for  the 
reason  that  any  transfer  to  new  axes  alters  the  form  of  ths 
function  and  not  merely  the  constants  involved  in  it.      It  is 
then  studied  here  partly  as  an  illustration  of  the  fact  that 
one  can  not  always  shift  the  axes  about  to  suit  his  convenience 
and  still  retain  the  same  form  of  function.      Using  so  far  as 
possible  the  same  notation  as  in  the  preceding  problems, 

Let  21  =  ths  range  on  the  x-axis. 

k  =  the  lower  limit  of  range . 
Then    k  +  2  1  =  the  upper  limit  of  range. 


y  =  y0(  a0+a,+  at+  ) 
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where 


J-  .  /oa  £+zl 
ii     AM,  *    J    *y      =  y0  1-jfX  +  *z 

xy^x  =/e      /V  >  — -y 


Since    A  =  2  Ijtq 


I  M0  =  an  +   a-,  F  f      a2  ,  

k(kf2  1) 

II  fiL  =  ao(k-/-l)  V*       ■+  a^F 

III  M0  =  f°(4  l2  -/-  6  kl  +  3  k2)  -f  an  ( k  +  l)  +  &  4 

^3  J-  2 


It  will  be  noticed  that  the  complexity  of  the  formulas  increases 
rapidly  with  the  number  of  constants. 

A  curve  of  this  form  in  two  constants  will  be  fitted 
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to  some  data  obtained  from  an  experiment  on  Boyle's  Law,  for 
which  a  more  perfect  fit  was  desired  than  that  given  "by  the 
first  term  as  ordinarily  expressed 

xy  =  k 
or  y  =  k/x 

The  assumed  function  is 

y  =  y0(ao+  -1) 

-  x 
In  the  following  table  columns  marked  x  and  y  give  the  data,  x 
the  volume,  y  the  pressure.      Since  theordinates  are  not  equi- 
distant, the  commonly  used  quadrature  formulas  fail,  and  the  a- 
reas  will  be  considered  a  succession  of  trapezoids.      b  =  the 
base  of  each  trapezoid  =  x0  -  x-^  ,  etc.,  as  in  tabulation  of 
areas  and  moments  for  the  sine  curve. 


B 

A 

G 

bG 

A 

X 

y 

b 

yi+sy2 

3B 

X 

29.0 

118.5 

•  9 

233  .3 

104.98 

348.1 

.45 

29.45 

3  091.661 

29.9 

114.8 

1.0 

225.8 

112.90 

33  6.8 

.50 

30.40 

3432.760 

30.9 

111.0 

1.2 

217  . 9 

130.74 

324.8 

.60 

31 .50 

4118.310 

32.1 

106.9 

1.4 

209.3 

146.51 

311  .7 

.70 

32.80 

4805.528 

33.5 

102.4 

1.3 

201.3 

130.90 

300.2 

.  65 

34.15 

4470.235 

34.8 

98.9 

1  ? 

194.6 

116.76 

290.3 

.60 

35.40 

4133.304 

'  36.0 

95.7 

1.4 

187.7 

131.3  9 

279.7 

.70 

36.70 

4822.013 

37.4 

92.0 

1 .5 

180.6 

135.45 

269.2 

.75 

38.15 

5167.418 

38.9 

88.6 

2.3 

174.2 

200.33 

259.8 

1.14 

40,04 

8021.213 

41.2 

85.6 

2.5 

164.2 

205.25 

242.8 

1 .23 

42  .43 

8708.758 

43.7 

78.6 

2.1 

153  .7 

161.88 

228.8 

1.04 

44.74 

7242.511 

45.8 

75.1 

1577.09 


58013.711 


* 
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A  =  1577.09 


V  1 


u    -  53015.711  _  _r 

i;il  -    1577.09    "  36 -7853 


2  1  =  45.8  -  29  =  16.8 


1  =  8.4 


=  93.87 


^0 

k  =  29 

k  +  1  =  37.4 

F  =  —  .  log^*2  1  =  -°272 
2  1  k 

Using  formulas  I  and  II  to  determine  the  constants 

M0  "  a0  +  alF 

=  a0(  k  4-  1)  +  ax 

(3)  ai  =  MQ  "  aQ    =  LL  -  a0(k  +  1) 

F 

MQ  -  a0    =  FM1  -  Fa0(k  -h  l) 

(4)  aQ  =  S&  I  M0 

u      F(k  +  1)  -  1 

Substituting  values  in  (4) 


IS.0 
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a    =  *0272  x  56.7855  -  1  =  0394 
0       .0272  x  37.4        -  1 

Subs ti tut ing  in  (5) 

■ 

a;L  =s  56.7853  -   .0324  x  37.4 
=  35.5732 
Hence  the  curve  is 

y  a  93 .87 ( .0324  t  35.5752) 

x 

=    2.94  +  533  ?,*26  Plate  VIII 

x 

The  following  table  shows  the  closeness  of  fit  of  the  theoretical 
and  experimental  curves. 


Observed         Computed         Difference         %  error 


X 

y 

y 

29.0 

118.5 

118.1 

.4 

.3 

29.9 

114.8 

114.6 

.2 

.2 

30.9 

111.0 

111.0 

0 

0 

32.1 

106.9 

107.0 

-.1 

.1 

55  .5 

102.4 

102.6 

_  9 

.2 

34.8 

98.9 

98.9 

0 

0 

3  6.0 

95  .7 

95.7 

0 

0 

37.4 

92.2 

-.2 

.2 

38.9 

88.6 

88.8 

.2 

41 '.2 

85.6 

84.0 

1.6 

1.9 

43  .7 

78.6 

79.3! 

-.7 

.9 

45  .8 

75  .1 

75.8 

.9 

Average  .4 


33. 

The  point  whose  coordinates  are     (41.2,  85.6)  gives 
a  %  error  7  1/2    times  as  large  as  the  average  of  all  the  rest. 
This  fact  with  theevident  break  made  "by  this  point  in  the  obser- 
vation curve  leads  one  to  suspect  an  error  in  the  observation. 
Nevertheless  the  curve  fits  very  well, and  .4  %  average  error  for 
a  function  in  two  constants  shows  the  suitability  of  the  form 
of  function  assumed. 

In  summing  up  the  relative  advantages  and  disadvantages 
of  the  methods  of  moments  and  least  squares  these  seem  to  be  the 
principal  points. 

1.  The  method  of  least  squares  will  so  determine  the 
constants  of  an  assumed  f one t ion  as  to  fit  as  well  as  possible 
the  plotted  points,  but  not  necessarily  the  polygon  formed  by 
connecting  these  points. 

2.  More  depends  on  the  choice  of  form  of  function 
than  on  any  great  refinement  of  accuracy  in  determination  of 
constants  in  a  function  of  unsuitable  form.      Since  the  method 
*of  moments  applies  to  a  greater  variety  of  forms,  it  has  in  this 
respect  a  decided  advantage. 

3.  Even  in  the  parabolic  form  to  which  the  method  of 
least  squares  is  most  easily  applicable  ,  the  method  of  moments 
gives  results  sensibly  as  good  with  less  labor,  and  with  perfect 
quadrature  formulas  will  give  as    good  results. 

For  these  reasons,  it  seems  that  for  the  practical 
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problems  of  the  physicist  and  engineer  the  method  of  moments  may 
"be  as  useful  as  Prof.  Pearson  has  already  shown  it  in  statisti- 
cal work. 
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SPECIAL  METHODS. 


The  advantage  in  certain  cases  of  fitting  a  curve  to 
the  logarithms  of  data  instead  of  directly  to  the  data  has 
been  discussed  and  illustrated  in  a  preceding  section  of  this 
paper,  and  the- use  of  logarithms  as  means  of  fitting  curves  of  the 
the  form 

y  =  kxn 

will  now  he  considered. 

Taking  logarithms 

log  y  =  log  k  +  n  log  x 

This  is  evidently  a  linear  equation  in  two  variables    log  x  and 
log  y    with  slope  n  and  y- intercept    log  k.      If  then  on  plot- 
ting logarithms  of  any  set  of  data  the  graph  is  a  straight 
line ,  the  data  can  be  fitted  by  a  curve  of  the  form 

y  =  kxn 

and  measuring  the  y- intercept  gives    log  k    and  the  tangent  of 
the  angle  made  with,  the  x-axis  is  the  value  of  n.      As  a  mecha- 
nical device  to  avoid  the  labor  of  looking  up  logarithms,  lo- 


X-<7X  I  S 
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logarithm  paper  may  be  used.      It  differs  from  other  coordinate 
paper  in  that  the  length  of  the  coordinate  corresponding  to  the 
number  printed  in  the  margin  is  the  logarithm  of  that  number  in- 
stead of  the  number  itself.      The  spacing  of  the  lines  is  thus 
exactly  like  that  on  a  slide  rule.      The  origin  is  marked  (1,1) 
for  the  logarithms  of  these  numbers  are  (0,0).      If  then  the  da- 
ta be  plotted  on  logarithm  paper,  the  mathematical  relations  in- 
volved are  exactly  the  same  as  if  the  logarithms  of  the  data 
had  been  plotted  on  ordinarry  coordinate  paper.      In  the  accom- 
panying figure,  Plate    IX,  are  plotted  data  of  an  experiment  on 
flow  of  water  through  an  orifice.      x  is  the  head  in  feet  and 
y  is  the  discharge  in  cubic  feet  per  second. 

x  y  x  y 

.7  6  .377  1.41  .520 

.89  .415  1.54  .547 

1.02  .441  1.67  .5  65 

1.15  .464  1.30  .589 

1.28  .495 

Drawing  a  line  V/hich  will  fit  these  points  as  closely  as  possible  , 

its  y- intercept  is  found  to  be  .435  and  its  slope  (the  ratio 

of  its  horizontal  and  vertical  projections)  to  be  .525.  Hence 
the  data  can  be  fitted  by 

y  =  .435  x*525 


r/oyv  of  wafer 
thru   a  4'  orifice. 

o  Observed. 
— 0 — Com  puied. 


54 
52 
So 

4f 


a. 


u 


41 


42 


3Z 


■3L 


.7 


Head  m  fi. 

1.0         n        U        IS  fS       ^       17  (■* 
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The  table  shows  observed  and  computed  values  of  y 
for  each  value  of  x,  Plate  IX. 


Observed 


Computed 


Difference 


X 

y 

y 

.76 

.377 

.377 

0 

.89 

.415 

.409 

.006 

1.02 

.441 

.440 

.001 

1.15 

.464 

.468 

-.004 

1.28 

.494 

.495 

-.001 

1.41 

.520 

.521 

-.001 

1.54 

.547 

.546 

-  .001 

1,57 

.5  65 

.5  69 

- .  004 

1.80 

.589 

.5  92 

-.003 

%  error 


4 
2 
9 
2 

2 

2 
r 

.5 


As  will  be  seen  from  the  table  the  fit  of  the  curve 
is  very  close ,  probably  within  the  limit  of  accuracy  of  the  ob- 
servations.     It  will  be  seen  that  the  formulas  here  derived 
differ  slightly  from  the  one  commonly  used. 


Where  c  is  a  constant  for  any  particular  orifice  and  value  of 
x  must  be  empirically  determined  and  tabulated  for  each  ori- 
fice over  such  range  of  variation  in  head  as  are  likely  to  oc- 
cur in  connection  with  that  orifice.      Evidently  for  the  experi- 
ment  here  worked  out    c  =  .435  x*  ~° 

In  the  problems  to  which  this  method  is  applicable 
it  forms  a  simple  and  valuable  point  of  attack  for  determining 
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an  unknown  oxponent  and  nonce  supplements  the  method  of  least 
squares  and  the  method  of  moments  at  a  point  where  both  fail  by 
leading  to  equations  too  complex  to  be  solved.       It  is  especially 
valuable  in  hydraulic  formulas  ,  though  it  applies  of  course 
wherever  the  function  takes  the  form 

y  =  kx" 

as  in  a  period  of  a  pendulum,  relation  of  stress  to  deformation 
in  concrete ,  etc . 


Another  use  of  logarithm  paper  in  carve  fitting  ap- 
plies to  functions  of  the  form 

(1)  y  -  a  =  k(x  -  b)n 

a  form  which  has  four  constants  and  has  them  so  disposed  as  to 

give  great  flexibility  to  the  curve. 

Differentiating 

dx  =  nk(x  -  b)u 

a  form  which  if  b  were  eliminated  would  be  exactly  like  that  of 
the  last  paragraph.      The  value  of  b  may  be  found  as  follows: 
Plot  the  data  and  connect  consecutive  points  by  straight  lines. 
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Then  if  x    and  y    are  the  coordinates    of  the  rth  point  and 
r  r 


x 

r 


^  ^  and  7r  +  i    those  of  the  (r+l)th,  the  slope  of  the  line 

joining  these  two  points  is    ^r+1  ~  ^r    which  is  a  fair  approxi- 

,         ^r+"l  ""  ^r 

mat ion  to  the  value  of        at  a  point  on  the  curve  mid-way  be  - 

dx 

tween  the  two.      Gall  the  abscissa  of  this  Doint  <x    ,  ,  /0 . 

rf  1/2 

Evidently  from  (2) 
dv 

dx"    =  o 


when    x  =  b.      If  then  corresponding  values  of  xr.j_-|y0  and 

i|y  be  plotted,  the  intersection  with  the  x-axis  will  give  the 
dx 

value  of  b.      In  some  cases  the  curve  will  not  intersect  the 
x-axis  within  the  range  of  obcervations .      In  such  cases  it 
must  be  prolonged  until  it  does  intersect.      There  is  here  of 
course  a  large  chance  for  error  but  if  b  be  taken  incorrectly 
the  fact  will  appear  in  the  next  step,  so  the  process  as  a  whole 
is  self-checking.      After  b  is  determined  the  function  is  of  the 
form,  previously  studied, 


y  =  kx 


n 


where  y  is  replaced  by  ^ 


x  "  »  "  x  -  b 


k  "  "          ■  nk 

n  "  "  "    n  -  1 
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Corresponding  values  of       and    x    .  -  /    -  "b    are  then  plotted 

dx  r-ri/d 

on  logarithm  paper.      If  they  plot  a  straight  line  b  has  been 

correctly  chosen.      If  they  form  a  regular  curve  b  is  in  error. 

If  the  points  are  irregularly  arranged  this  form  of  function 

is  not  .veil  adapted  to  the  data.      In  making  successive  trials 

to  determine  b  it  is  useful  to  note  that  a  value  of  b  which  is 

too  large  makes  the  curvD  convex  on  one  side,  and  a  value  too 

small  makes  it  convex  on  the  other  side.      If  two  such  curves 

have  been  found  it  is  then  known  that  the  true  value  of  b  lies 

between  the  two  which  gave  these  curves  and  its  value  may  be 

approximated  by  successive  subdivisions  of  the  interval.  When 

a  value  of  b  is  found  which  does  make  and  (x      ,  /_  -  b) 

,  r+-l/2 

dx  ' 

plot  a  straight  line  on  logarithm  paper,  the  y- intercept  of 
this  line  gives  the  value  of  nk  and  the  slope  the  value  of 
(n  -  1).      These  two  equations  may  then  be  solved  for  k  and 
n.      All  the  constants  of  tho  original  function  are  now  deter- 
mined except  a,  and  it  may  now  be  found  by  substituting  the 
data  into  this  equation.      This  substitution  forms  a  check  on 
the  accuracy  of  fit.      A  perfect  fit  would  show  the  same  value 
of  a  for  each  pair  of  values  of  x  and  y.      This  of  course  is 
not  to  be  expected  and  an  average  value  of  a  is  taken  and  the 
error  in  any  y  is  the  difference  between  the  value  of  a  com- 
puted for  that  particular  point  and  the  average  adopted  for  the 
formula. 


G 

mph  o 

r 

d 

X 

5  5/  pp 

/7  w  n 

u  ri  U 

/  Mi  V 

t 

Data. 

.  Al  - 

1 

b 

=  3 

■J 


3  ^ 

a 


Co 


5 

Jo 

7 


.9 


fraction  of  total  depih.  P/at&Yl 
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From  the  graphical  nature  of  the  process  ,  it  is  not 
adapted  to  data  using  a  large  number  of  significant  figures. 
Hence  in  the  problem  treated,  namely,  the  same  one  that  was 
treated  by  least  squares  and  by  moments,  the  velocity  of  the 
of  the  Mississippi  River  at  varying  fractions  of  its  total  depth, 
the  velocities  are  taken  only  to  three  significant  figures. 

In  the  table  x  and  y  are  the  data  and  they  are  tabu- 
lated on  every  other  line  so  as  to  leave  space  for  writing 
in  the  corresponding  values  of  x      ,  t0    and    QZ.      The  latter 

r-T     3./ d  ^y. 

is  the  ratio  of  the  y- increment  to  the  x- increment  in  and  since 

the  x- increment  is  uniformly     .1,  ~%  =  increment  in  y  multiplied 

by  10.      Plotting    xr+i/2    an^    "f^  »  Plate  XI,  gives  a  line 

whose  x-intercept  is  .3  which  is  therefore  the  value  of  b. 

The  column    x  -  b    is  then  filled  out  and  corresponding  values 

of    ^    and    x      1  /o    are  plotted  on  logarithm  paper,  (Plate 
dx  r  + 

XII).  Since  there  is  no  logarithm  of  a  negative  number  only 
absolute  value  is  considered  and  the  sign  is  taken  account  of 
afterwards  as  always  in  working  with  logarithms. 
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0 


X 

y 

3  .20 

dx 

x  -  b 

.05 

.3 

-2.5 

.1 

3  .23 

.15 

.2 

-1.5 

.2 

3.25 

.25 

.1 

-  .05 

.5 

3  .26 

.35 

-.1 

.05 

.4 

3.25 

.45 

-.2 

.15 

.5 

3  .25 

.55 

-.5 

.25 

.6 

3.18 

.65 

-.5 

.35 

.7 

3  ,13 

.75 

-.7 

.45 

.8 

3.06 

.85 

-.8 

.55 

.9 

2.98 

Plotting  corresponding  values  of    -        and    xr -n/g     on  logar- 
ithm paper  and  drawing  the  best  straight  line  to  represent  this 
data  gives  the  y- intercept  =  nk  =  1.26  and  the  slope  =  n-1  =  .846 

n  =  1.846 
and  k  =  .683 

Taking  account  now  of  the  fact  that  ^    and    x  -  b    had  opposite 

dx 

signs  we  have  really  determined  the  constants  in 

a  -  y  =  k(x  -  b)n 
and  substituting  values  of  k,  b  and  n 

a  -  y  =  . 683 ( x  -  ,3 )  * 


Curve   showing   relati  on  of 

Velocity  and  Dept  h 


Mississippi  River. 


Y=  l2tf+  .(*U(x-3) 
-  Observed 

O  C orrt  pvitd. 


3.3  0 
3.3,5" 

32  0  ^ 

3/5"  v 

3.10  5 

,s 

3.05"  £ 


3.0  0 


rracf/on  of  fota/  depfh.^  PlafeXIIl. 
o      ./      3      .3      *      s     Z      f    1      f  ti 


where  absolute  value  and  not  sign  of    (x  -  .3)  is  con- 

sidered. 

Corresponding  values  of  x  and  y  substituted  in  this 
equation  gave  the  values  of  a  indicated  in  column  a  of  the  ta- 
ble.     The  error  in  the  value  of  y  arising  from  adopting  the 
average  value  of  a ,  that  is  the  difference  between  an  observed 
y  and  one  computed  by 

£.259  -  y  =  .683(x  -  .3:)1'846 

is  tabulatedin  column  marked  v. 


X 

y 

a 

v  % 

error 

3  .20 

3.274 

.015 

.5 

.1 

3.23 

3  .265 

.006 

o 

.2 

3  .25 

3.260 

.001 

!o 

.3 

r?      <*>  C 

O  .  C  D 

3.260 

.001 

.0 

.4 

3.25 

3  .260 

.001 

.0 

.5 

3.23 

3.265 

.006 

.2 

.6 

3,18 

3  .254 

-.005 

.2 

.7 

3  .13 

3  .256 

-  .003 

.1 

.8 

3  .06 

3.250 

-  .009 

.3 

.9 

2.98 

3  .246 

-.013 

jA 

Average 

3.25  9 

Average 

.2 

Plate  XIII 


The  elimination  of  a  and  b  as  performed  in  this  pro- 
blem is  due  to  Prof.  Langsdorf  and  has  been  published  by  him  in 
The  Journal  of  the  Association  of  Engineering  Societies.  The 
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problem  has  "been  taken  up  here  as  an  illustration  of  the  pos- 
sibilities in  the  use  of  logarithm  paper.      It  is  especially 
useful  just  where  other  methods  of  curve-fitting  fail,  namely 
in  finding  of  an  unknown  exponent. 
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ON  THE  FORI.;  OF  FUNCTION  TO  BE  ASSUMED. 


While  this  paper  has  so  far  dealt  with  the  method  of 
fitting  a  previously  assumed  function  to  certain  data  by  a 
suitable  determination  of  its  constants  the  important  question 
as  to  the  choice  of  form  of  function  remains  unanswered.  Con- 
cerning thefive  types  of  function  suitable  for  frequency  curves 
mentioned  on  page  13,  Prof.  Pearson  has  set  up  analytical  cri- 
teria to  determine  which  should  be  chosen  in  any  particular 
case.      It  would,  be  of  great  importance  if  analogous  tests 
could  be  set  up  for  curves  suited  to  engineering  and  physical 
data,  but  little  has  been  done  in  studying  this  problem. 
While  nothing  approaching  a  systematic  treatment  of  this  whole 
subject  is  attempted  here,  there  will  be  presented  certain  tests 
for  some  of  the  commoner  functions  based  or  the  calculus  of 
finite  differences. 

Let 

y  =  f(x) 

and    (y0>xQ)>  (^i*  *0+h)»  (y2,x0+2h)  (yn,  x0+nh) 

be  corresponding  values  of  x  and  y  taken  at  equal  distances 
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along  x-axis. 
Lst 


=  J't  - 


d2o  = 


Then 


D 


2  _ 


 D. 


t  — 


k  +1 


—  v 


k 


y2  -  2yx  +  y0 


D2  =  DT       -  d; 

k       k    1  k 


D3  =  D2  -  D2 
Olo 


=  y*  -  sy?  +  3yn  -  y 


o 


Kence  since  successive  coefficients  are  formed  by  same  addition 
as  coefficients  in  powers  of  a  binomial, 


=  y„  -  ry 


r  -  1 


r(r  -  1) 
21 


yr  -  2 


-  -  -  to  (r  4-1)  terms 


D«  =  D1  ,  +•  D2 
s        s-1  s-1 


=  D'      -•-   2D2      +  D3 

s-2  s-2  s-2 


=  D'      +  3D        +  3D        +  D4 

s-o  s-3  s-5  s-3 


And  in  general 


r  r 
D    =  D 


4-  D 


r  +1 

3-1 


=  Dr 


r  +1 

+-  2D     0     4-  D 
s„o  s-2 


to  (r  1)  terms 
r  +2 


=  Dr  -t-sD^1+s(s-l)D^  +  2  +  

0  0      ^    0 
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This  discussion  will  now  be  applied  to  a  parabolic  curve  , 

y  =  aQ  +  ajX  4.  a?x  -) —  -  +  anx* 
Since     (x0,yQ),  (x0+h,y  )  -  -  -  (xQ+nh,yn)     are  on  the  curve 

^0  =  a0  +  alx0  +  a2xo  +  +an^ 

y-L  =  aQ  +  a^XQ-t-  h)  +  a?(xQ+  h)2+  -  -  -    an(x0  +  h)n 


yn  ~  a0  +  al^x0+        *  a2^x0  +nil^2  +"  ~  "  ~  an^x0  ~*~ 

D0  =  alh  +  a2(2x0h-l-h2)  +  ^anf  x0  *  ^  "  x  J 

=  a1h  +-  a9(2x0h-h3h2)+  +  an  j(x0 -f- 2h)n  -  (x0-Mi)^ 


All  first  order  differences  are  therefore  of  degree  (n-1)  in 
x  and  are  all  independent  of  aQ.      Moreover  the  first  term  in 
each  first  order  difference  is  a-[_h.      The  firstterra  in  the  co- 
efficient of  any  a  as  a0  is  rx^-^h. 

<^  (J 

Therefore 

D2  =  2aQh2  +   

0  6 

of  degree    n  -  2  in  x  and 
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D0  =  nt  anhn 

for  each  order  of  differences  drops  an  a  from  the  "beginning  and 
one  degree  in  x. 

Therefore  if  a  set  of  data  is  to  "be  exactly  fitted  by  a  curve  of 
the  parabolic  type  of  nth  degree,  the  nth  order  of  differences 
is  constant  and  accordingly  higher  order  of  differences  vanish. 

It  will  now  be  shown  that  this  condition  here  proved 
necessary  is  also  sufficient.      By  definition  of  this  notation, 

yi  =    +  D6 

y2  =  yl  *  Dl 

=  yQ  +  2DJ  +  D2 

y3  =  y2  +  D2  =  yo  +  3Di+3Do  +  D° 

Thus  the  coefficients  are  apparently  those  of  the  binomial  the- 
orem.     It  will  be  assumed  that  y.     has  such  coefficients  and 

k 

proved  that       ,^    will  have. 

n     yk  ■  y0  +  kD'  *  1]   +  D* 

Then  yk  ^  =  yk  +  D' 
From  I 

D;  =  D'  +  kD2  +■  k(k  -  1)  D3  +  

k       0         o   «i   0 
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Adding  II 


+  (Mk  -  1)  +  k(k  -  l)(k  -  2))Dr  -  1  +  

,  fk(k-l)  (k-rf3)  .  k(k-l)  -  -  (k-r ->-5)Lr-l 

=  y_  +  (k-hl)D*  +  (k  fl)k  J?  .  (k-M)k(k-l) 

+  r k(k-l)  -  -  (k-r  +  5)(k-r +  a.+  r-l)l  _wl 
L  r-ll  JDo 

+  (k+l)k(k-l)  (k-r-f5) 

T  r  -  li  D0"+ 


which,  is  the  form  of  corff icients  of  "binomial  expansion  of 
(k+l)st  power. 

Hence  the  formula  holds  for    k  f 1     if  it  does  for  k.      But  it 

does  hold  for    k  =  3  ,  hence  for  all  values  of  k. 

Then 

i'r  =  y0  +  rD6 +  z^rt  +  


But    x    =  xn  +  rh 
r  0 


r  —      ■*■  \j 
h 


r  -  1  =  fr  10^.J^1  ,  etc. 

h 
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=   y     +    (Xr  -   XQ)  (XP  -  XQ^    (Xp  -   XQ^Jl)  . 

(xr  -  xo)     (x^  -  x0  -  h)  (x    -  x0  -  2h)  _ 

h  2h  3h  ° 

But  xp  and  yp  are  any  corre spending  values  of  x  and  y,  hence  may 
be  the  running  coordinates.      Hence  if  differences  beyond  the 
nth  vanish,  y  is  expressed  in  constants  and  integral  powers  of 
x  not  extending  beyond  the  nth  degree. 

Hence  a  necessary  and  sufficient  condition  that  cer- 
tain data  may  be  fitted  by  a  parabolic  curve  of  nth  degree  is 
that  the  (n-fl)st  order  of  differences  vanish.       If  the  differ- 
ences do  not  all  vanish  but  approximate  that  condition  then  the 
curve  will  be  an  approximate  fit. 

To  illustrate,  the  10  points  (-3,0),  (-2,-15),  (-1,0), 
(0,9),  (1  ,0),  (2,-15),  (3,0),  (4,105),  (5  ,384),  (6,945)  can 
be  exactly  fitted  by  a  fourth  degree  parabola,  as  fifth  order 
of  differences  vanish. 


<8 

i 

L 

y 

0 

-15 

0 

9 

0 

-15 

0 

105 

384  945 

D* 

-15 

15 

9 

—  9 

-15 

15 

105 

279 

5  61 

D2 

30 

—  o 

-18 

-6 

30 

90 

174 

282 

D3 

-36 

—  X/d 

12 

36 

60 

84 

108 

D4 

24 

24 

24 

24 

24 

24 

D5 

0 

0 

0 

0 

0 
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Successive  orders  of  differences  in 

y  =  a  sin  nx  +  b  cos  nx 

may  "be  calculated  "by  first  getting  differences  in  the  simpler 
functions 

y  =  sin  nx         and  y  =  cos  nx 

x  will  be  given  successive  increments  of  unity  and  y  will  assume 
the  values 

sin(nx)  ,  sin(nxfn),  sin(nx-fSn),  -  -  -  sin(nx-t-kn) 


D'  =  sin(nx+n)  -  sin(nx) 
0 


2  sin  S  cos(nx-)-~) 


2  sm  —  sm(nx-i  — ) 


D.j  =  sin  (nx+2n)  -  sin(nx+ n) 


2  sin  £(cos  + 

p.  2 


-2  sin  £  sin(nx  +  ^±2L) 
2  2 

=  sin^ix  +  (r+l)nj  -  sin  (nx-frn) 

=  2  sin  £  cosfn::4>(2rfl)  n] 
2         L  2  J 

=  2  sin  £  sinfnx  +  J&±IljUJB:J 
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D2  =  Di  -  D& 


=  2  sin  £  jsin(nx      3n         -  sin(nx  ±  n  t^77")] 

=  2  sin  2  /sin  £  cos  2lL±2T.  \ 
21       2  2  -* 

-  /o         n>2     -   /      ,  2n-t-2  7T. 

-  (2  sm  -)    sm(nx  -f  ) 

2  Z 

D2  =  (2  sin  £)2  ccsL  +  (ar  +  2)n»7T  7 
r  2  &  J 

=  (2  sin  £)2  sin/nx  +  (8r»2)n4  2/r] 
2  *■  ?.  J 


as  is  found  by  taking       from  D^^.-^. 

Similarly  by  operating  on  thegeneral  term  are  found 


D3  = 

(2  sin  £)3 

r 

2 

J)m  = 

(2  sin  £)3 

r 

2 

For      y  = 

cos  (nx) 

successive 

values  of 

cos  (nx)  ,  cos  (nx+n),  cos  (nx+2n),  -  -  -  cos(nx-f-kn) 


=  cos  (nx  +  n)  -  cos  nx 


=  -2  sin  —  sin  (nx-f-U) 

2  2 

=  2  sin  §  cos  (nx  + 

2  <d 
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=  cos  (nx>2n)  -  ccs(nx  +  n) 

=  2  sin  -  cos(nx  +  ^±E) 
2  2 


D'  =  2  sin  £  cos/nx  4-  (gr  +  pn  +  TT  1 
r  2        L  2 

D2  =  2  sin  life  sin  §  cos(nx  4- 
r  2  L  2 

fn     •     n\2         f         (2r+2)n  4-  2  7T 7 
=  (2  sin  L1)     cospx  H  j 

D3  =  (2  sin  -)3  cosfnx  +•  <2g  +  8)n*  37rj 
r  2  I  2 

Dm  =  (2  sin  n)m  cos/nx +  ( 2r  + m)n  + 
r  2  1  2  J 

Substituting  the  values  of  these  successive  differences  in 

y  =  a  sin(nx)  -f  h  cos(nx) 

D'  =  afs  sin  £  sin(nx-+  n  +  Tr)(  4-  b)2  sin  §  cos(nx+  n  t77")/ 
0        L  2  2      J  2  ^  J 

=  2  sin  §/a  sin(nx+  £  +  yr )  +  b  cos(nx  +  H-tH)  [ 
2L  2  2  J 

D»  =  2  sin  gfa  sin(nx  +  M+IT)  +.  b  cos(nx  +  — /^)] 
X  2  L  2 

^>  _  o     •     nf      ■   (        (2r  +  l)n  +  7T  ,  ( 2r 1  )nv-/r/ 

Df  =  2  sm  —la  sin(nx  +  1 —  )  +  b  cos  nx+  - — — —  )/ 

r  2L  2  ?  J 

D2  =  (2  sin  £)2[a  sin(nx  +  2£±®T)  +b  cos(nx  4-  2n£27r)J 

D2  =  (3  sin  £)2ra  sin  (nx  +  (8r  +  2)ntzg  ^  b  Cos(r4gr*2)n'g7i 
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lP  =  (2  sin  £)m  afsin(nx  -h  li'(n4"7r))  -+  b  cos(nx  4-  m(n  7 
0  2         L  2  2  J 

D    =  (2  sin  ~)    a  sm(nx  +•     .  .T  ''^^v-'v )-f  b  oos(nx   u — —  ) 


The  result  shows  that  a  necessary  condition  that  a  set 
of  data  may  be  exactly  fitted,  ^oy  a  curve  of  the  form 

y  =  a  sin(nx)  -f-  b  cos(nx) 

is  that  successive  orders  of  differences  be  also  sine  curve 
functions  of  period  the  same  as  the  original  data.      The  vortical 
scale  is  changed  by  successive  powers  of  the  constant    2  sin  — 
and  the  initial  point  on  x-axis  is  shifted  by  the  introduction  of 
the  term    ( +  sOn  but  the  essential  fact  of  a  repeating 

function  of  same  period  persists. 

So  far  as  is  known  to  the  writer  tests  for  the  various 
forms  of  function  suited  to  engineering  and  physical  problems 
have  not  been  worked  out.      The  problem  is  an  important  one  and 
the  calculus  of  finite  differences  seems  to  be  the  most  pro- 
mising method  of  attack. 
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